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PROTOPORPHYRINOGEN OXIDASE ("PROTOX") GENES 

The present invention relates to DNA molecules encoding protoporphyrinogen 
oxidase ("protox") enzymes and t€> DNA molecules encoding herbicide-tolerant forms of the 
enzyme protoporphyrinogen oxidase ("protox"). The invention further relates to herbicide- 
tolerant plants as well as methods for tissue culture selection and herbicide application 
based on these herbicide-tolerant forms of protox. 

I. The Protox Enzyme and its Involvement in the Chlorophyll/Heme Biosynthetic Pathway 

The biosynthetic pathways that lead to the production of chlorophyll and heme share 
a number of common steps. Chlorophyll is a light harvesting pigment present in all green 
photosynthetic organisms. Heme is a cofactor of hemoglobin, cytochromes, P450 mixed- 
function oxygenases, peroxidases, and catalyses (see, e.g. Lehninger, Biochemistry, Worth 
Publishers, New York (1975)), and is therefore a necessary component for all aerobic 
organisms. 

The last common step in chlorophyll and heme biosynthesis is the oxidation of 
protoporphyrinogen IX to protoporphyrin IX. Protoporphyrinogen oxidase (referred to herein 
as "protox") is the enzyme that catalyzes this last oxidation step (Matringe et al,, Biochem. 
J. 260:231 (1989)). 

The protox enzyme has been purified either partially or completely from a number of 
organisms including the yeast Saccharomyces cerevisiae (Labbe-Bois and Labbe, In 
Biosynthesis of Heme and Chlorophyll, E.H. Dailey, ed. McGraw Hill: New York, pp. 235-285 
(1990)), barley etloplasts (Jacobs and Jacobs, Biochem, J. 244: 219 (1987)), and mouse 
liver (Dailey and Karr, Biochem, 26: 2697 (1 987)). Genes encoding protox have been 
isolated from two prokaryotic organisms, Escherichia co// (Sasarman etal., Can. J. 
Microbiol. 39: 1 1 55 (1 993)) and Bacillus subtilis (Dailey et aL, J. Biol. Chem. 269: 81 3 
(1994)). These genes share no sequence similarity; neither do their predicted protein 
products share any amino acid sequence identity. The £ co// protein is approximately 21 
kDa, and associates with the cell membrane. The B. subtilis protein is 51 kDa, and is a 
soluble, cytoplasmic activity. 

Protox encoding genes have now also been isolated from humans (see Nishimura et 
al., J. BioL Chem, 270(14): 8076-8080 (1995) and plants (International application no. 
PCT/1B95/00452 filed June 8, 1995, published Dec. 21. 1995 as WO 95/34659). 

II. The Protox Gene as a Herbicide Target 
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The use of herbicides to control undesirable vegetation such as weeds or plants in 
crops has become an almost universal practice. The relevant market exceeds a billion 
dollars annually. Despite this extensive use, weed control remains a significant and costly 
problem for farmers. 

Effective use of herbicides requires sound management: For instance, time and 
method of application and stage of weed plant development are critical to getting good 
weed control with herbicides. Since various weed species are resistant to herbicides the 
production of effective herbicides becomes increasingly important. Novel herbicides can^ 
now be discovered using high-throughput screens that implement recombinant DNA 
technology. Metabolic enzymes essential to plant growth and development can be 
recombinantly produced though standard molecular biological techniques and utilized as 
herbicide targets in screens for novel inhibitors of the enzymes' activity. The novel inhibitors 
discovered through such screens may then be used as herbicides to control undesirable 
vegetation. 

Unfortunately, herbicides that exhibit greater potency, broader weed spectrum and 
more rapid degradation in soil can also have greater crop phytotoxicity. One solution 
applied to this problem has been to develop crops that are resistant or tolerant to 
herbicides. Crop hybrids or varieties resistant to the herbicides allow for the use of the 
herbicides without attendant risk of damage to the crop. Development of resistance can 
allow application of a herbicide to a crop where its use was previously precluded or limited 
(e.g. to pre-emergence use) due to sensitivity of the crop to the heriDicide. For example, 
U.S. Patent No. 4,761,373, incorporated herein'by reference, is directed to plants resistant 
to various imidazolinone or sulfonamide herbicides. The resistance is conferred by an 
altered acetphydroxyacid synthase (AHAS) enzyme. U.S. Patent No. 4,975,374, 
incorporated herein by reference, relates to plant cells and plants containing a gene 
encoding a mutant glutamine synthetase (GS) resistant to inhibition by herbicides that were 
known to inhibit GS, e.g. phosphinothricin and methionine sulfoximine. U.S. Patent No. 
5.013,659. incorporated herein by reference, is directed to plants that express a mutant 
acetolactate synthase (ALS) that renders the plants resistant to inhibition by sulfonylurea 
herbicides. U.S. Patent No. 5,162.602, incorporated herein by reference, discloses plants 
tolerant to inhibition by cyclohexanedione and aryloxyphenoxypropanoic acid heriDicides. 
The tolerance is confen-ed by an altered acetyl coenzyme A carboxylase{ACCase). U.S. 
Patent No. 5,554,798, incorporated herein by reference, discloses transgenic glyphosate 
resistant maize plants, which tolerance is conferred by an altered 5-enolpymvyl-3- 
phosphoshikimate (EPSP) synthase gene. 
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The protox enzyme serves as the target for a variety of herbicidal compounds. The 
herbicides that inhibit protox include many different structural classes of molecules (Duke et 
ai, Weed Sci 39: 465 (1991); Nandihalli et aL, Pesticide Biochem. PliysioL 43: 1 93 (1 992); 
Matringe a/.. FEBS Lett 245: 35 (1989); Yanase and Andoh, Pesticide Biochem. Physiol. 
35: 70 (1989)). These herbicidal compounds include the diphenylethers {e.g. acifluorfen, 5- 
[2-chloro-4-(trif luoromethyl)phenoxy]-2-nitrobezoic acid; its methyl ester; or oxyfluorfen, 2- 
chIoro-1-(3-ethoxy-4-nitrophenoxy)-4-(trifluorobenzene)}, oxidiazoles, (e.g. oxidiazon, 3-[2,4- 
dichloro-5-(1-methylethoxy)phenyl]-5-(1 ,1 -dimethylethyl)-1 ,3,4-oxadiazol-2-(3H)-one), cyclic 
imides (e.g. S-231 42, A/-(4-chIoro-2-fluoro-5-propargyloxyphenyl)-3,4,5,6- 
tetrahydrophthalimide; chlorophthalim, A/-(4-chlorophenyl)-3,4,5,6-tetrahydrophthalimide), 
phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1-(2,3,4-trichlorophenyl)-4-nitropyrazolyl-5- 
oxyjpropionate; M&B 39279). pyridine derivatives (e.g. LS 82-556), and phenopylate and its 
Ophenylpyrrolidino- and piperidinocarbamate analogs. Many of these compounds 
competitively inhibit the normal reaction catalyzed by the enzyme, apparently acting as 
substrate analogs. 

Typically, the inhibitory effect on protox is determined by measuring fluorescence at 
about 622 to 635 nm. after excitation at about 395 to 410 nM (see, e.g. Jacobs and Jacobs, 
Enzyme 28: 206 (1 982); Sherman et al.. Plant Physiol 97: 280 (1 991 )). This assay is 
based on the fact that protoporphyrin IX is a fluorescent pigment, and protoporphyrinogen 
IX is nonfluorescent. 

The predicted mode of action of protox-inhibitlng herbicides involves the 
accumulation of protoporphyrinogen IX in the chloroplast. This accumulation is thought to 
lead to leakage of protoporphyrinogen IX into the cytosol where it is oxidized by a 
peroxidase activity to protoporphyrin IX. When exposed to light, protoporphyrin IX can 
cause formation of singlet oxygen in the cytosol. This singlet oxygen can in turn lead to the 
formation of other reactive oxygen species, which can cause lipid peroxidation and 
membrane disruption leading to rapid cell death (Lee ef a/., Plant Physiol, 102: 881 (1993)). 

Not all protox enzymes are sensitive to herbicides that inhibit plant protox enzymes. 
Both of the protox enzymes encoded by genes isolated from Escherichia coli (Sasamnan et 
aL, Can, J. Microbiol 39: 1 155 (1993)) and Bacillus subtilis (Dailey et al, J. Biol Chem, 
269: 813 (1994)) are resistant to these herbicidal inhibitors. In addition, mutants of the 
unicellular alga Chlamydomonas reinhardtii resislant to the phenylimide herbicide S-231 42 . 
have been reported (Kataoka etal, J. Pesticide Scl 15:449 (1990); Shibata ef a/., In 
Research in Photosynthesis, Vol. 111. N. Murata, ed. KluwenNetherlands, pp. 567-570 
(1992)). At least one of these mutants appears to have an altered protox activity that is 
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resistant not only to the herbicidal inhibitor on which the mutant was selected, but also to 
other classes of protox inhibitors (Oshio etaL, Z Naturforsch. 48c: 339 (1 993); Sato et ai. 
In ACS Symposium on Porphyric Pesticides, S. Duke, ed. ACS Press: Washington, D.C. 
(1994)). A mutant tobacco cell line has also been reported that is resistant to the inhibitor 
S-21432 (Che ef a/.,. Z Naturforscf), 48c:350 (1993). 

111. Plastid Transfomnation and Expression 

Plastid transformation, in which genes are inserted by homologous recombination 
into some or all of the several thousand copies of the circular plastid genome present in 
each plant cell, takes advantage of the enormous copy number advantage over nuclear- 
expressed genes to pemiit expression levels that may exceed 10% of the total soluble plant 
protein. In addition, plastid transfomnation is desirable because in most plants plastid- 
encoded traits are not pollen transmissible; hence, potential risks of inadvertent transgene 
escape to wild relatives of transgenic plants is obviated. Plastid transformation technology is 
extensively described in U.S. Patent Nos. 5.451.513, 5,545,817, 5,545,818, and 5,576,198; 
in PCT Application Nos. WO 95/16783 and WO 97/32977; and in McBride etaL, Proc. Natl, 
Acad ScL USA 91: 7301-7305 (1994). all of which are incorporated herein by reference. 
Plastid transformation via biolistics was achieved initially In the unicellular green alga 
Clilamydomonas reinhardtii {Bqynton etaL (1988) Science 240: 1534-1537, incorporated 
herein by reference) and this approach, using selection for c/s-acting antibiotic resistance 
loci (spectinomycin/streptomycin resistance) or complementation of non-photosynthetic 
mutant phenotypes. was soon extended to Nicotiana tabacum {S\/ab etaL (1990) Proc. 
NatL Acad. Sc/. USA 87: 8526-8530, incorporated herein by reference). 

The basic technique for tobacco chloroplast transfomnation involves the particle 
bombardment of leaf tissue or PEG-mediated uptake of plasmid DNA in protoplasts with 
regions of cloned plastid DNA flanking a selectable antibiotic resistance marker. The 1 to 
1.5 kb flanking regions, tended "targeting sequences," facilitate homologous recombination 
with the plastid genome and thus allow the replacement or modification of specific regions 
of the. 156 kb tobacco plastid genome. Initially, point mutations in the chloroplast 16S rDNA 
and rps12 genes confening resistance to spectinomycin and/or streptomycin were utilized 
as selectable markers for transformation (Svab, Z., Hajdukiewicz, P.. and Maliga. P. (1990) 
Proc. Natl. Acad. Sci. USA 87, 8526-8530; Staub. J. M.. and Maliga, P. (1992) Plant Cell 4, 
39-45, incorporated herein by reference). This resulted in stable homoplasmic 
transfonnants at a frequency of approximately one per 100 bombardments of target leaves. 
The presence of cloning sites between these markers allowed creation of a plastid targeting 
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vector for introduction of foreign genes (Staub, J.M., and Maliga, P., EMBO J, 12: 601-606 
(1993), incorporated herein by reference). Substantial increases in transformation 
frequency were obtained by replacement of the recessive rRNA or r-protein antibiotic 
resistance genes with a dominant selectable marker, the bacterial aadA gene encoding the 
spectinomycin-detoxifying enzyme aminoglycoside-y-adenyltransferase (Svab, Z., and 
Maliga, P. (1993) Proa NatL Acad, ScL USA 90, 913-917, incorporated herein by 
reference). Previously, this marker had been used successfully for high-frequency 
transformation of the plastid genome of the green alga Chlamydomonas reinhardtii 
(Goldschmidt-Clermont, M. (1991) NucL Acids Res, 19, 4083-4089, incorporated herein by 
reference). Recently, plastid transformation of protoplasts from tobacco and the moss 
Physcomitrella patens has been attained using polyethylene glycol (PEG) mediated DNA 
uptake (O'Neill etal. (1993) Plant J. 3: 729-738; Koop etaL (1996) Planta 199: 193-201, 
both of which are incorporated herein by reference). 

The present invention provides DNA molecules isolated from maize and comprising 
the coding sequence of an enzyme having protoporphyrinogen oxidase (protox) activity. 
In particular, the coding sequence of a mature maize protoporphyrinogen oxidase is set 
forth in SEQ ID NO: 44, and the corresponding amino acid sequence for a mature maize 
protoporphyrinogen oxidase enzyme is set forth In SEQ ID NO:45. The present invention 
further provides chimeric genes comprising such DNA molecules. 

The present invention also provides DNA molecules isolated from wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, and sugar cane encoding enzymes having 
protoporphyrinogen oxidase (protox) activity and chimeric genes comprising such DNA. 
Sequences of such DNA molecules are set forth in SEQ ID NOs: 9 (wheat), 11 (soybean), 
15 (cotton), 17 (sugar beet), 19 (oilseed rape), 21 (rice), 23 (sorghum), and 36 (sugar cane). 

The present invention also provides modified fonms of plant protoporphyrinogen 
oxidase (protox) enzymes that are resistant to compounds that inhibit unmodified naturally 
occurring plant protox enzymes, and DNA molecules coding for such inhibitor-resistant plant 
protox enzymes. Thus, in one aspect the present invention provides a DNA molecule 
encoding a plant protox enzyme that is capable of being incorporated into a DNA constmct 
used to transfomn a plant containing wild-type, hert^icide-sensitive protox, wherein the DNA 
molecule has at least one point mutation relative to a wild-type DNA molecule encoding 
plant protox such that upon transfomnation with the DNA construct the plant contains the 
DNA molecule, which renders the plant resistant to the application of a herbicide that 
inhibits naturally occurring plant protox. The present invention includes chimeric genes and 
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modified forms of naturally occurring protox genes that can express the inhibitor-resistant 
plant protox enzymes in plants. 

Genes encoding inhibitor-resistant plant protox enzymes can be used to confer 
resistance to protox-inhibitory herbicides in whole plants and as a selectable marker in plant 
cell transformation methods. Accordingly, the present invention also includes plants, 
including the descendants thereof, plant tissues and plant seeds containing plant 
expressible genes encoding these modified protox enzymes. These plants, plant tissues 
and plant seeds are resistant to protox-inhibitors at levels that normally are inhibitory to the 
naturally occurring protox activity in the plant. Plants encompassed by the invention 
especially include those that would be potential targets for protox inhibiting herbicides, 
particularly agronomically important crops such as maize and other cereal crops such as 
bariey, wheat, sorghum, rye, oats, turf and forage grasses, millet and rice. Also comprised 
are other crop plants such as sugar cane, soybean, cotton, sugar beet, oilseed rape and 
tobacco. 

The present invention accordingly provides a method for selecting plant cells 
trahsfonned with a DNA molecule of the invention that encodes a herbicide-tolerant form of 
plant protox. The method comprises introducing the DNA molecule into plant ceils whose 
growth is sensitive to inhibition by herbicides to which the protox encoded by the DNA 
molecule is resistant, thus forming a transformed plant cell. The transformed plant cell 
whose growth is resistant to the selected herbicide is identified by selection at a hertDicide 
concentration that inhibits the growth of untransfomied plant cells. 

The present invehtiori is directed further to rriethbds for the prqduction of plants, 
including plant material, such as for example plant tissues, protoplasts, cells, calli, organs, 
plant seeds, embryos, pollen, egg cells, zygotes, together with any other propagating 
material and plant parts, such as for example flowers, stenris, fruits, leaves, roots originating 
in transgenic plants or their progeny previously transformed by means of the process of the 
invention, which produce an inhibitor-resistant form of the plant protox enzyme provided 
herein. Such plants may be stably transformed with a structural gene encoding the 
resistant protox, or prepared by direct.selection techniques whereby herbicide resistant lines 
are. isolated, characterized and developed. 

In another aspect, the present invention is directed to a method for controlling 
unwanted vegetation growing at a locus where a herbicide-tolerant, agronomically useful 
plant, which is transfomied with a DNA molecule according to the present invention that 
encodes a herbicide-tolerant form of plant protox, has been cultivated. The method 
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comprises applying to the locus to be protected an effective amount of lierbicide that 
inhibits naturally occurring protox activity. 

The present invention is further directed to probes and methods for detecting the 
presence of genes encoding inhibitor-resistant ioms of the plant protox enzyme and 
quantitating levels of inhibitor-resistant protox transcripts in plant tissue. These methods 
may be used to identify or screen for plants or plant tissue containing and/or expressing a 
gene encoding an inhibitor-resistant form of the plant protox enzyme. 

The present invention also relates to plastid transformation and to the expression of 
DNA molecules in a plant plastid. In a preferred embodiment, a native plant protox enzyme 
or a modified plant protox enzyme is expressed in plant plastids to obtain herbicide resistant 
plants. 

In a further embodiment, the present invention is directed to a chimeric gene 
comprising: (a) a DNA molecule isolated from a plant, which in its native state encodes a 
polypeptide that comprises a plastid transit peptide, and a mature enzyme that is natively 
targeted to a plastid of the plant by the plastid transit peptide, wherein the DNA molecule is 
modified such that it does not encode a functional plastid transit peptide; and (b) a promoter 
capable of expressing the DNA molecule in a plastid, wherein the promoter is operatively 
linked to the DNA molecule. The DNA molecule may be modified in that at least a portion of 
the native plastid transit peptide coding sequence is absent from the DNA molecule. 
Alternatively, the DNA molecule may be modified in that one or more nucleotides of the 
native plastid transit peptide coding sequence are mutated, thereby rendering an encoded 
plastid transit peptide nonfunctional. The present invention also relates to plants 
homoplasmic for chloroplast genomes containing such chimeric genes. In a preferred 
embodiment, the DNA molecule encodes an enzyme that is naturally inhibited by a 
herbicidal compound. In this case, such plants are resistant to a herbicide that naturally 
inhibits the enzyme encoded by a DNA molecule according to the present invention. 

The present invention is also directed to plants made resistant to a herbicide by 
transforming their plastid genome with a DNA molecule according to the present invention 
and to methods for obtaining such plants. In a preferred embodiment, the DNA molecule 
encodes an enzyme that is naturally inhibited by a herbicidal compound. In a more 
preferred embodiment, the DNA molecule encodes an enzyme having protoporphyrinogen 
oxidase (protox) activity, which is modified so that it that confers resistance to protox 
inhibitors. A further embodiment of the present invention is directed to a method for 
controlling the growth of undesired vegetation, which comprises applying to a population of 
the above-described plants an effective amount of an inhibitor of the enzyme. 
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The present invention also provides a novel method for selecting a transplastomic 
plant cell, comprising the steps of: introducing the above-described chimeric gene into the 
plastome of a plant cell; expressing the encoded enzyme in the plastids of the plant cell; 
and selecting a cell that is resistant to a herbicidal compound that naturally inhibits the 
activity of the enzyme, whereby the resistant cell comprises transfomned plastids. In a 
preferred embodiment, the enzyme is naturally inhibited by a herbicidal compound and the 
transgenic plant is able to grow on an amount of the herbicidal compound that naturally 
inhibits the activity of the enzyme. In a further preferred embodiment, the enzyme has 
protoporphyrinogen oxidase (protox) activity and is modified so that it that confers 
resistance to protox inhibitors. 

Brief description of the Sequence Listing: 

SEQ ID N0:1 : DNA coding sequence for an Arabidopsis thaliana protox-1 protein. 

SEQ ID N0:2: Arabidopsis protox-1 amino acid sequence encoded by SEQ ID NO:1. 

SEQ ID N0:3: DNA coding sequence for an Arabidopsis thaliana protox-2 protein. 

SEQ ID NO'A: Arabidopsis protox-2 amino acid sequence encoded by SEQ ID NO:3. 

SEQ ID N0:5: Partial DNA coding sequence for a maize protox-1 protein. 

SEQ ID N0:6: Partial maize protox-1 amino acid sequence encoded by SEQ ID N0:5. 

SEQ ID N0:7: DNA coding sequence for a maize protox-2 protein. 

SEQ ID N0:8: Maize protox-2 amino acid sequence encoded by SEQ ID N0:7. 

SEQ ID N0:9: Partial DNA coding sequence for a wheat protox-1 protein. 

SEQ ID N0:10: Partial wheat protox-1 amino acid sequence encoded by SEQ ID N0:9. 

SEQ ID N0:1 1 : DNA coding sequence for a soybean protox-1 protein. 

SEQ ID N0:12: Soybean protox-1 protein encoded by SEQ ID NO:11. 

SEQ ID N0:13: Promoter sequence from Arabidopsis ttialiana protox-1 gene. 

SEQ ID N0:14: Promoter sequence from maize protox-1 gene. 

SEQ ID N0:15: DNA coding sequence for a cotton protox-1 protein. 

SEQ ID N0:16: Cotton protox-1 amino acid sequence encoded by SEQ ID N0:15. 

SEQ ID N0:17: DNA coding sequence for a sugar beet protox-1 protein. 

SEQ ID N0:18: Sugar beet protox-1 amino acid sequence encoded by SEQ ID NO:17. 

SEQ ID N0:19: DNA coding sequence for an oilseed rape protox-1 protein. 

SEQ ID N0:20: Oilseed rape protox-1 amino acid sequence encoded by SEQ ID N0:19. 

SEQ ID N0:21 : Partial DNA coding sequence for a rice protox-1 protein. 

SEQ ID NO:22: Partial rice protox-1 amino acid sequence encoded by SEQ ID N0:21 . 

SEQ ID NO:23: Partial DNA coding sequence for a sorghum protox-1 protein. 



8 



wo 01/68826 PCT/EPOl/02753 

SEQ ID N0:24; Partial sorghum protox-1 amino acid sequence encoded by SEQ ID NO:23. 

SEQ ID NO:25: Maize protox-1 intron sequence. 

SEQ ID NO:26: Promoter sequence from sugar beet protox-1 gene. 

SEQ ID N0:27: Pclp„P1a - plastid c/pP gene promoter top strand PGR primer. 

SEQ ID N0:28: Pclp_P1b - plastid cIpP gene promoter bottom strand PGR primer. 

SEQ ID NO:29: Pclp„P2b - plastid cIpP gene promoter bottom strand PGR primer. 

SEQ ID NO:30: Trps16„P1a - plastid rps16 gene top strand PGR primer. 

SEQ ID N0:31: Trps16_p1b - plastid rps76 gene bottom strand PGR primer. 

SEQ ID N0:32: minpsb„U - plastid psM gene top strand primer. 

SEQ ID NO:33: minpsb_L - plastid psM gene bottom strand primer. 

SEQ ID NO:34: APRTXPIa - top strand PGR primer. 

SEQ ID NO:35: APRTXPIb - bottom strand PGR primer. 

SEQ ID NO:36: Partial DNA coding sequence for a sugar cane protox-1 protein. 

SEQ ID NO:37: Partial sugar cane protox-1 amino acid sequence encoded by SEQ ID NO:36. 

SEQ ID NO:38: Sub-sequence #1 (APAiF). 

SEQ ID NO:39: Sub-sequence #8 (YIGGA3). 

SEQ ID NO:40: Sub-sequence #12 (IGGA12). 

SEQ ID N0:41 : Sub-sequence #13 (SWXLA13). 

SEQ ID NO:42: Sub-sequence #15 (GA15XGL). 

SEQ ID NO:43: Sub-sequence #17 (YVAnG). 

SEQ ID N0:44: DNA coding sequence for a mature maize protox-1 protein. 
SEQ ID NO:45: Mature maize protox-1 amino acid sequence encoded by SEQ ID NO:44. 
SEQ ID NO:46: Predicted coding sequence for a maize protox-1 polypeptide precursor. 
SEQ ID NO:47: Predicted amino acid sequence of a maize protox-1 polypeptide precursor 

encoded by SEQ ID NO:46. 

The following vector molecules have been deposited with Agricultural Research 
Service, Patent Gulture Gollection (NRRL), Northern Regional Research Genter, 1815 North 
University Street, Peoria, Illinois 61604, U.S.A on the dates indicated below: 

Wheat protox-1 a, in the pBluescript SK vector, was deposited March 19, 1996, as 
pWDG-13 (NRRL #621545). 

Soybean protox-1, in the pBluescript SK vector, was deposited December 15, 1995 
as pWDC-12 (NRRL #8-21516). 
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Cotton protox-1, in the pBluescript SK vector, was deposited July 1, 1996 as pWDC- 
15{NRRL#B-21594). 

Sugar beet protox-1 , in the pBluescript SK vector, was deposited July 29, 1 996, as 
pWDC-1 6 (NRRL #B-21 595N). 

Oilseed rape protox-1 , in the pBluescript SK vector, was deposited August 23, 1996, 
as pWDC-17 (NRRL #B-21615). 

Rice protox-1 , in the pBluescript SK vector, was deposited December 6, 1996, as 
pWDC-18 (NRRL #B-21648). 

Sorghum protox-1 , in the pBluescript SK vector, was deposited December 6, 1996, 
as pWDC-19 (NRRL #B-21649). 

Resistant mutant pAraC-2Cys, in the pMut-1 plasmid, was deposited on November 
14, 1994 under the designation pWDC-7 with the Agricultural Research Culture Collection 
and given the deposit designation NRRL #21 339N, 

AraPTI Pro containing the Arabidopsis protox-1 promoter was deposited December 
15, 1995, as pWDC-11 (NRRL #B-21515) 

^ A plasmid containing the maize protox-1 promoter fused to the remainder of the 
maize protox-1 coding sequence was deposited March 19, 1996 as pWDC-14 (NRRL #B- 
21546). 

A plasnriid containing the sugar beet protox-1 promoter was deposited December 6, 
1996, as pWDC-20 (NRRL #B-21 650). 

For clarity, certain terms used in the specification are defined and presented as 

follows: 

Associated With / Operatively Linked: refers to two DNA sequences that are related 
physically or functionally. For example, a promoter or regulatory DNA sequence is said to 
be "associated with" a DNA sequence that codes for an RNA or a protein if the two 
sequences are operatively linked, or situated such that the regulator DNA sequence will 
affect the expression level of the coding or structural DNA sequence. 

Chimeric Gene: a recombinant DNA sequence in which a promoter or regulatory 
DNA sequence is operatively linked to. or associated with, a DNA sequence that codes for 
an mRNA or which is expressed as a protein, such that the regulator DNA sequence is able 
to regulate transcription or expression of the associated DNA sequence. The regulator 
DNA sequence of the chimeric gene is not normally operatively linked to the associated 
DNA sequence as found in nature. 
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Coding DNA Sequence: a DNA sequence that is translated in an organism to 
produce a protein. 

Corresponding To: in the context of the present invention, "corresponding to" means 
that when the amino acid sequences of various protox enzymes are aligned with each 
other, such as in Table 1A, the amino acids that "correspond to" certain enumerated 
positions in Table 1 A are those that align with these positions in Table 1A, but that are not 
necessarily in these exact numerical positions relative to the particular protox enzyme's 
amino acid sequence. Likewise, when the amino acid sequence of a particular protox 
enzyme (for example, the soybean protox enzyme) is aligned with the amino acid sequence 
of a reference protox enzyme (for example, the Arabidopsis protox-1 sequence given in 
SEQ ID NO:2), the amino acids in the soybean protox sequence that "con'espond to" certain 
enumerated positions of SEQ ID NO:2 are those that align with these positions of SEQ ID 
N0:2, but are not necessarily in these exact numerical positions of the soybean protox 

enzyme's amino acid sequence. 

Hert3icide: a chemical substance used to kill or suppress the growth of plants, plant 

cells, plant seeds, or plant tissues. 

Heterologous DNA Sequence: a DNA sequence not naturally associated with a host 
cell into which it is introduced, including non-naturally occurring multiple copies of a 
naturally occurring DNA sequence. 

Homologous DNA Sequence: a DNA sequence naturally associated with a host cell 

into which it is introduced. 

Homoplasmic: refers to a plant, plant tissue or plant cell, wherein all of the plastids 
are genetically identical. In different tissues or stages of developnnent, the plastids may take 
different forms, e.g., chloroplasts, proplastlds, etioplasts, amyloplasts, chromoplasts, and so 
forth. 

Inhibitor: a chemical substance that inactivates the enzymatic activity of a protein 
such as a biosynthetic enzyme, receptor, signal transduction protein, structural gene 
product, or transport protein that is essential to the growth or survival of the plant. In the 
context of the instant invention, an inhibitor is a chemical substance that inactivates the 
enzymatic activity of protox. The tenm "herbicide" is used herein to define an inhibitor when 
applied to plants, plant cells, plant seeds, or plant tissues. 

Isolated: in the context of the present invention, an isolated nucleic acid molecule or 
an isolated enzyme is a nucleic acid molecule or enzyme that, by the hand of man, exists 
apart from its native environment and is therefore not a product of nature. An isolated 
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nucleic acid molecule or en2yme may exist in a purified fomn or may exist in a non-native 
environment such as, for example, a transgenic host cell. 

Minimal Promoter: promoter elements, particularly a TATA element, that are inactive 
or that have greatly reduced promoter activity in the absence of upstream activation. In the 
presence of a suitable transcrption factor, the minimal promoter functions to permit 
transcription. 

Modified Enzyme Activity: enzyme activity different from that which naturally occurs 
in a plant (i.e. enzyme activity that occurs naturally in the absence ofdirect or indirect 
manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally 
occurring enzyme activity. 

Nucleic Acid Molecule: a linear segment of single- or double-stranded DNA or RNA 
that can be isolated from any source. In the context of the present invention, the nucleic 
acid molecule is preferably a segment of DNA. 

Plant: refers to any plant or part of a plant at any stage of development. Therein are 
also included cuttings, cell or tissue cultures and seeds. As used in conjunction with the 
present invention, the term '"plant tissue" includes, but is not limited to, whole plants, plant 
cells, plant organs, plant seeds, protoplasts, callus, cell cultures, and any groups of plant 
cells organized into structural and/or functional units. 

Plastome: the genome of a plastid. 

Protox-1 : chloroplast protox. 

Protox-2: mitochondrial protox. 

Significant Increase: an increase in enzymatic activity that is larger than the margin 
of error inherent in the measurement technique, preferably an increase by about 2-fold or 
greater of the activity of the wild-type enzyme in the presence of the inhibitor, more 
preferably an increase by about 5-fold or greater, and most preferably an increase by about 
10-fold or greater. 

Substantially Similar: with respect to nucleic acids, a nucleic acid molecule that has 
at least 60 percent sequence identity with a reference nucleic acid molecule. In a prefen-ed 
embodiment, a substantially similar DNA sequence is at least 80% identical to a reference 
DNA sequence; in a more prefen^ed embodiment, a substantially similar DNA sequence is at 
least 90% identical to a reference DNA sequence; and in a most preferred embodiment, a 
substantially similar DNA sequence is at least 95% identical to a reference DNA sequence. 
A substantially similar nucleotide sequence typically hybridizes to a reference nucleic acid 
molecule, or fragments thereof, under the following conditions: hybridization at 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50°C; wash with 2X SSC, 1% 
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SDS, at 50°C. With respect to proteins or peptides, a substantially similar amino acid 
sequence is an amino acid sequence that is at least 90% identical to the amino acid 
sequence of a reference protein or peptide and has substantially the same activity as the 

reference protein or peptide. 

Tolerance / Resistance: the ability to continue normal growth or function when 

exposed to an inhibitor or herbicide. 

Transformation: a process for introducing heterologous DNA into a cell, tissue, or 
plant. Transformed cells, tissues, or plants are understood to encompass not only the end 
product of a transformation process, but also transgenic progeny thereof. 

Transit Peptide: a signal polypeptide that is translated in conjunction with a protein 
encoded by a DNA molecule, forming a polypeptide precursor. In the process of transport to 
a selected site within the cell, a chloroplast for example, the transit peptide can be cleaved 
from the remainder of the polypeptide precursor to provide an active or mature protein. 

Transformed: refers to an organism such as a plant into which a heterologous DNA 
molecule has been introduced. The DNA molecule can be stably integrated into the genome 
of the plant, wherein the genome of the plant encompasses the nuclear genome, the plastid 
genome and the mitochondrial genome. In a transformed plant, the DNA molecule can also 
be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be 
auto-replicating. A "non-transfonned" plant refers to a wild-type organism, i.e., a plant, 
which does not contain the heterologous DNA molecule. 
Transplastome: a transfomried plastid genome. 

Nucleotides are indicated by their bases by the following standard abbreviations: 
adenine (A), cytosine (C), thymine (T), and guanine (G). Amino acids are likewise indicated 
by the following standard abbreviations: alanine (ala; A), arginine (Arg; R), asparagine (Asn; 
N) . aspartic acid (Asp; D), cysteine (Cys; C), glutamine (Gin; Q), glutamic acid (Glu; E), 
glycine (Gly; G), histidine (His; H), isoleucine (lie; I), leucine (Leu; L), lysine (lys; K), 
methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S). threonine 
(Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V). Furthermore (Xaa; X) 
represents any amino acid. 

I. Plant Protox Coding Sequences 

In one aspect, the inventors of the present invention have determined for the first 
time the NH2 terminus of the mature maize protoporphyrin oxidase. To achieve this result, 
maize etioplasts were isolated and the protoporphyrin oxidase was immunoprecipitated from 
the etioplast extract using anti-protoporphyrin oxidase antibodies covalently bound to 
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Protein A Sepharose resin. The NH2 temninus of the mature maize protoporphyrin oxidase 
was determined by microsequencing and found to be located at the alanine at position 36 in 
the amino acid sequence set forth in SEQ ID NO:47. This is an unexpected and surprising 
result since using the software program ChloroP (Nielsen, H., Engelbrecht, J., Bnjnak, S.. & 
von Heijne, G. (1997) Prot. Engineer. 10, 1-6), the NH2 temriinus of the mature protein was 
predicted at the amino acid corresponding to the cystein at position 35 in the amino acid 
sequence set forth in SEQ ID NO:47. The present invention therefore discloses a novel and 
non-obvious DNA molecule comprising the mature coding sequence of the maize 
protoporphyrin oxidase (protox). The nucleotide sequence of the coding sequence of the 
mature maize protoporphyrin oxidase (protox) is provided in SEQ ID NO:44 and the 
corresponding amino acid sequence is set forth in SEQ ID NO:45. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes protoporphyrinogen oxidase (referred to herein as "protox''), the enzyme that 
catalyzes the oxidation of protoporphyrinogen IX to protoporphyrin IX, from wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, and sugar cane. The partial DNA coding 
sequence and corresponding amino acid sequence for a wheat protox enzyme are provided 
as SEQ ID NOs:9 and 10, respectively. The DNA coding sequence and corresponding 
amino acid sequence for a soybean protox enzyme are provided as SEQ ID N6s:11 and 
12, respectively. The DNA coding sequence and corresponding amino acid sequence for a 
cotton protox enzyme are provided as SEQ ID NOs:15 and 16, respectively. The DNA 
coding sequence and corresponding amino acid sequence for a sugar beet protox enzyme 
are provided as SEQ ID NOs:17 and 18, respectively. The DNA coding sequence and 
corresponding amino acid sequence for an oilseed rape protox, enzyme are provided as 
SEQ ID N0s:19 and 20, respectively. The partial DNA coding sequence and corresponding 
amino acid sequence for a rice protox enzyme are provided as SEQ ID N0s:21 and 22, 
respectively. The partial DNA coding sequence and corresponding amino acid sequence 
for a sorghum protox enzyme are provided as SEQ ID NOs:23 and 24, respectively. The 
partial DNA coding sequence and corresponding amino acid sequence for a sugar cane 
protox enzyme are provided as SEQ ID NOs:36 and 37, respectively. 

The DNA coding sequences and corresponding amino acid sequences for protox 
enzymes from Arabidopsis thaliana and maize are provided herein as SEQ ID N0s:1-4 
(y4rai?/dops/s) and SEQ ID NOs:5-8 (maize). 

The invention therefore is directed to a DNA molecule encoding a 
protoporphyrinogen oxidase (protox) comprising a eukaryotic protox selected from the 
group consisting of a wheat protox enzyme, a soybean protox enzyme, a cotton protox 
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enzyme, a sugar beet protox enzyme, an oilseed rape protox enzyme, a rice protox 
enzyme, a sorghum protox enzyme, and a sugar cane protox enzyme. 

Preferred within the scope of the invention are isolated DNA molecules encoding the 
protoporphyrinogen oxidase (protox) enzyme from dicotyledonous plants, but especially 
from soybean plants, cotton plants, sugar beet plants and oilseed rape plants, such as 
those given in SEQ ID NOS: 11. 15, 17 and 19. More preferred are isolated DNA molecules 
encoding the protoporphyrinogen oxidase (protox) enzyme from soybean, such as given in 
SEQ ID N0:1 1 , and sugar beet, such as given in SEQ ID N0:17. 

Also preferred are isolated DNA molecules encoding the protoporphyrinogen 
oxidase (protox) enzyme from monocotyledonous plants, but especially from wheat plants, 
rice plants, sorghum plants, and sugar cane plants, such as those given in SEQ ID NOS: 9, 
21 , 23, and 36. More preferred are isolated DNA molecules encoding the 
protoporphyrinogen oxidase (protox) enzyme from wheat such as given in SEQ ID N0:9. 

In another aspect, the present invention is directed to isolated DNA molecules 
encoding the protoporphyrinogen oxidase (protox) enzyme protein from a dicotyledonous 
plant, wherein the protein comprises the amino acid sequence selected from the group 
consisting of SEQ ID NOs: 12, 16, 18 and 20. Further comprised are isolated DNA 
molecules encoding the protoporphyrinogen oxidase (protox) enzyme protein from a 
monocotyledonous plant, wherein the protein comprises the amino acid sequence selected 
from the group consisting of SEQ ID NOs: 10, 22, 24, and 37. More prefen-ed is an isolated 
DNA molecule encoding the protoporphyrinogen oxidase (protox) enzyme wherein the 
protein comprises the amino acid sequence from wheat such as given in SEQ ID NO:10. 
More prefen-ed is an isolated DNA molecule encoding the protoporphyrinogen oxidase 
(protox) enzyme wherein the protein comprises the amino acid sequence from soybean, 
such as given in SEQ ID NO:12 and sugar beet, such as given in SEQ ID N0:18. 

Using the information provided by the present invention, the DNA coding sequence 
for the protoporphyrinogen oxidase (protox) enzyme from any eukaryotic organism may be 

obtained using standard methods. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a wheat protox enzyme and that comprises a nucleotide sequence that hybridizes 
to the coding sequence shown in SEQ ID N0:9 under the following hybridization and wash 
conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 
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In yet another aspect, the present invention is directed to an isolated DNA molecule 
that encodes a soybean protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:1 1 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In still another aspect, the present invention is directed to an isolated DNA molecule 
that encodes a cotton protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:15 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7,0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1% SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a sugar beet protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:17 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes an oilseed rape protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:19 under the following 
hybridization and wash conditions: 

(a) hybridization In 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a rice protox enzyme and that comprises a nucleotide sequence that hybridizes to 
the coding sequence shown in SEQ ID N0:21 under the following hybridization and wash 
conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 
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In another aspect, the present invention is directed to an Isolated DNA molecule that 
encodes a sorghum protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:23 under the following 

hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM 

EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a sugar cane protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:36 under the following ' 

hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 

mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

The.isolated eukaryotic protox sequences taught by the present invention may be 
manipulated according to standard genetic engineering techniques to suit any desired 
purpose. For example, the entire protox sequence or portions thereof may be used as 
probes capable of specifically hybridizing to protox coding sequences and messenger 
RNA's. To achieve specific hybridization under a variety of conditions, such probes include 
sequences that are unique among protox coding sequences and are preferably at least 10 
nucleotides in length, and most preferably at least 20 nucleotides in length. Such probes 
may be used to amplify and analyze protox coding sequences from a chosen organism via 
the well known process of polymerase chain reaction (PGR). This technique may be useful 
to isolate additional protox coding sequences from a desired organism or as a diagnostic 
assay to detemnine the presence of protox coding sequences in an organism. 

Factors that affect the stability of hybrids determine the stringency of the 
hybridization. One such factor is the melting temperature T„, which can be easily calculated 
according to the formula provided in DNA PROBES, George H. Keller and Mark M. Manak , 
Macmillan Publishers Ltd, 1993, Section one: Molecular Hybridization Technology; page 8 
ff. 

The preferred hybridization temperature is in the range of about 25°0 below the calculated 
melting temperature T^ and preferably in the range of about 12-15''G below the calculated 
melting temperature Tm and in the case of oligonucleotides in the range of about 5-1 0°C 
below the melting temperature Tm. 
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Comprised by the present invention are DNA molecules that hybridize to a DNA 
molecule according to the invention as defined hereinbefore, but preferably to an 
oligonucleotide probe obtainable from the DNA molecule comprising a contiguous portion of 
the sequence of the protoporphyrinogen oxidase (protox) enzyme at least 10 nucleotides in 
length, under moderately stringent conditions. 

The invention further embodies the use of a nucleotide probe capable of specifically 
hybridizing to a plant protox gene or mRNA of at least 10 nucleotides length in a 
polymerase chain reaction (PGR). 

In a further embodiment, the present invention provides probes capable of 
specifically hybridizing to a eukaryotic DNA sequence encoding a protoporphyrinogen 
oxidase activity or to the respective mRNA and methods for detecting the DNA sequences 
in eukaryotic organisms using the probes according to the invention. 

Protox specific hybridization probes may also be used to map the location of the 
native eukaryotic protox gene(s) in the genome of a chosen organism using standard 
techniques based on the selective hybridization of the probe to genomic protox sequences. 
These techniques include, but are not limited to. Identification of DNA polymorphisms 
identified or contained within the protox probe sequence, and use of such polymorphisms to 
follow segregation of the protox gene relative to other markers of known map position in a 
mapping population derived from self fertilization of a hybrid of two polymorphic parental 
lines (see e.g. Heientjaris ef a/., Plant MoL S/o/. 5; 109 (1985). Sommer etaL Biotechniques 
12: 82 (1 992); D'Ovidio et aL, Plant MoL BloL 15: 1 69 (1 990)). While any eukaryotic protox 
sequence is contemplated to be useful as a probe for mapping protox genes from any 
eukaryotic organism, preferred probes are those protox sequences from organisms more 
closely related to the chosen organism, and most preferred probes are those protox^ 
sequences from the chosen organism. Mapping of protox genes in this manner is 
contemplated to be particularly useful in plants for breeding purposes. For instance, by 
knowing the genetic map position of a mutant protox gene that confers herbicide resistance, 
flanking DNA markers can be identified from a reference genetic map (see, e.g., Heientjaris, 
Trends Genet 3: 21 7 (1 987)). During introgression of the herbicide resistance trait into a 
new breeding line, these markers can then be used to monitor the extent of protox-linked 
flanking chromosomal DNA still present in the recurrent parent after each round of back- 
crossing. 

Protox specific hybridization probes may also be used to quantitate levels of protox 
mRNA in an organism using standard techniques such as Northern blot analysis. This 
technique may be useful as a diagnostic assay to detect altered levels of protox expression 
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that may be associated with particular adverse conditions such as autosomal dominant 
disorder in humans characterized by both neuropsychiatric symptoms and skin lesions, 
which are associated with decreased levels of protox activity (Brenner and Bloomer, New 

Engl J. Med. 302: 765 (1 980)). 

A further embodiment of the invention is a method of producing a DNA molecule 
comprising a DNA portion encoding a protein having protoporphyrinogen oxidase (protox) 

enzyme activity comprising: 

(a) preparing a nucleotide probe capable of specifically hybridizing to a plant 
protox gene or mRNA. wherein the probe comprises a contiguous portion of 
the coding sequence for a protox protein from a plant of at least 1 0 
nucleotides length; 

(b) probing for other protox coding sequences in populations of cloned genomic 
DNA fragments or cDNA fragments from a chosen organism using the 
nucleotide probe prepared according to step (a); and 

(c) isolating and multiplying a DNA molecule comprising a DNA portion encoding 
a protein having protoporphyrinogen oxidase (protox) enzyme activity. 

A further embodiment of the invention is a method of isolating a DNA molecule from 
any plant comprising a DNA portion encoding a protein having protoporphyrinogen oxidase 

(protox) enzyme activity. 

(a) preparing a nucleotide probe capable of specifically hybridizing to a plant 
protox gene or mRNA, wherein the probe comprises a contiguous portion of 
the coding sequence for a protox protein from a plant of at least 1 0 
nucleotides length; 

(b) probing for other protox coding sequences in populations of cloned genomic 
DNA fragments or cDNA fragments from a chosen organism using the 
nucleotide probe prepared according to step (a); and 

(c) isolating a DNA molecule comprising a DNA portion encoding a protein 
having protoporphyrinogen oxidase (protox) enzyme activity. 

The invention further comprises a method of producing an essentially pure DNA 
sequence coding for a protein exhibiting protoporphyrinogen oxidase (protox) enzyme 

activity, which method comprises: 

(a) preparing a genomic or a cDNA library from a suitable source organism using 

an appropriate cloning vector; 

(b) hybridizing the library with a probe molecule; and 
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(c) identifying positive hybridizations of the probe to the DNA clones from the 
library that is clones potentially containing the nucleotide sequence 
corresponding to the amino acid sequence for protoporphyrinogen oxidase 
(protox). 

The invention further comprises a method of producing an essentially pure DNA 
sequence coding for a protein exhibiting protoporphyrinogen oxidase (protox) enzyme 
activity, which method comprises; 

(a) preparing total DNA from a genomic or a cDNA library; 

(b) using the DNA of step (a) as a template for PGR reaction with primers 
representing low degeneracy portions of the amino acid sequence of 
protoporphyrinogen oxidase (protox). 

A further object of the invention is an assay to identify inhibitors of 
protoporphyrinogen oxidase (protox) enzyme activity that comprises: 

(a) incubating a first sample of protoporphyrinogen oxidase (protox) and its 
substrate; 

(b) measuring an uninhibited reactivity of the protoporphyrinogen oxidase 
(protox) from step (a); 

(c) incubating a first sample of protoporphyrinogen oxidase (protox) and its 
substrate in the presence of a second sample comprising an inhibitor 
compound; 

,<> 

(d) measuring an inhibited reactivity of the protoporphyrinogen oxidase (protox) 
enzyme from step (c); and 

(e) comparing the inhibited Reactivity to the uninhibited reactivity of 
protoporphyrinogen oxidase (protox) enzyme. 

A further object of the invention is an assay to identify inhibitor-resistant 
protoporphyrinogen oxidase (protox) mutants that comprises: 

(a) incubating a first sample of protoporphyrinogen oxidase (protox) enzyme and 
its substrate in the presence of a second sample comprising a 
protoporphyrinogen oxidase (protox) enzyme inhibitor; 

(b) measuring an unmutated reactivity of the protoporphyrinogen oxidase 
(protox) enzyme from step (a); 

(c) incubating a first sample of a mutated protoporphyrinogen oxidase (protox) 
enzyme and its substrate in the presence of a second sample comprising 
protoporphyrinogen oxidase (protox) enzyme inhibitor; 
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(d) measuring a mutated reactivity of the mutated protoporphyrinogen oxidase 
(protox) enzyme from step (c); and 

(e) comparing the mutated reactivity to the unmutated reactivity of the 
protoporphyrinogen oxidase (protox) enzyme. 

A further object of the invention is a protox enzyme inhibitor obtained by a method 

according to the invention. 

For recombinant production of the enzyme in a host organism, the protox coding 
sequence may be inserted into an expression cassette designed for the chosen host and 
introduced into the host where it is recombinantly produced. -The choice of specific 
regulatonr sequences such as promoter, signal sequence. 5' and 3' untranslated 
sequences, and enhancer, is within the level of skill of the routineer In the art. The resultant 
molecule, containing the individual elements linked in proper reading frame, may be 
inserted into a vector capable of being transformed into the host cell. Suitable expression 
vectors and methods for recombinant production of proteins are well known for host 
organisms such as E. co//(see. e.g. Studierand Moffatt, J. Mol. Biol. 789; 113 (1986); 
Brosius. DNA 8: 759 (1 989)), yeast (see. e.g., Schneider and Guarente. Meth. Enzymol. 
194: 373 (1991)) and insect cells (see. e.g., Luckow and Summers. Bio/Technol. 6:47 
(1988)). Specific examples include plasmids such as pBluescript (Stratagene, La Jolla. 
CA). pFLAG (International Biotechnologies. Inc.. New Haven. CT). pTrcHis (Invitrogen. La 
Jolla. CA). and baculovirus expression vectors, e.g.. those derived from the genome of 
Autographica californica nuclear polyhedrosis virus (AcMNPV). A preferred 
baculovirus/insect system is pVl1 1 392/Sf 21 cells (Invitrogen. La Jolla. CA). 

In a preferred embodiment, a coding sequence encoding a mature protox enzyme is 
inserted into an appropriate expression cassette and the mature protox enzyme is 
recombinantly produced in a desired host cell. Alternatively, a coding sequence encoding a 
protox polypeptide precursor comprising an entire plastid transit peptide or a portion of the 
plastid transit peptide is inserted into an appropriate expression cassette, and the protox 
polypeptide precursor is recombinantly produced in the host cell. The transit peptide is then 
cleaved to release the mature protein. The cleavage is for example carried out by in-vitro 
digestion of the polypeptide precursor, for example using a plastid transit peptide peptidase. 

Recombinantly produced eukaryotic protox enzyme is useful for a variety of 
purposes. For example, it may be used to supply protox enzymatic activity in vitro. It may 
also be used in an in vitro assay to screen known herbicidal chemicals whose target has not 
been identified to determine if they inhibit protox. Such an in vitro assay may also be used 
as a more general screen to identify chemicals that inhibit protox activity and that are 
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therefore herbicide candidates. Recombinantly produced eukaryotic protox enzyme may 
also be used in an assay to identify inhibitor-resistant protox mutants {see International 
application no. PCT/IB95/00452 filed June 8, 1995, published Dec. 21. 1995 as WO 
95/34659. incorporated by reference herein in its entirety). Alternatively, recombinantly 
produced protox enzyme may be used to further characterize its association with known 
inhibitors in order to rationally design new inhibitory herbicides as well as herbicide tolerant 
fomns of the enzyme. 

II. Inhibitor Resistant Plant Protox Enzymes 

In another aspect, the present invention teaches modifications that can be made to 
the amino acid sequence of any eukaryotic protoporphyrinogen oxidase (referred to herein 
as "protox") enzyme to yield an inhibitor-resistant fomn of this enzyme. Preferably, the 
eukaryotic protox enzyme is a plant protox enzyme. The present invention is directed to 
inhibitor-resistant protox enzymes having the modifications taught herein, to DNA molecules 
encoding these modified enzymes, and to chimeric genes capable of expressing these 
modified enzymes in plants. 

The present invention is thus directed to an isolated DNA molecule encoding a 
modified eukaryotic protoporphyrinogen oxidase (protox) having at least one amino acid 
modification, wherein the amino acid modification has the property of conferring resistance 
to a protox inhibitor, that is wherein the modified protox is tolerant to an inhibitor in amounts 
that inhibit the naturally occurring eukaryotic protox. As used herein "inhibif refers to a 
reduction in enzymatic activity observed in the presence of a subject compound compared 
' to the level of activity observed in the absence of the subject compound, wherein the 
-percent level of reduction is preferably at least 10%, more preferably at least 50%, and 
mpst preferably at least 90%. 

Preferred is a DNA molecule encoding a modified eukaryotic protoporphyrinogen 
oxidase (protox) that is a plant protox, wherein the modified protox is tolerant to a herbicide 
in amounts that inhibit the naturally occurring protox activity. Even more preferred is a 
protox selected from the group consisting of an Arabidopsis protox enzyme, a maize protox 
enzyme, a wheat protox enzyme, a soybean protox enzyme, a cotton protox enzyme, a 
sugar beet protox enzyme, an oilseed rape protox enzyme, a rice protox enzyme, a 
sorghum protox enzyme, and a sugar cane protox enzyme having at least one amino acid 
modification, wherein the modified protox is tolerant to a herbicide in amounts that inhibit 
the naturally occurring protox activity. 
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As used herein, the expression "substantially conserved amino acid sequences- 
refers to regions of amino acid homology between polypeptides comprising protox enzymes 
from different sources. In the present invention, seventeen substantially conserved ammo 
acid sub-sequences, designated 1-19 respectively, are shown in Table 1B. One sk,lled .n 
the art could align the amino acid sequences of protox enzymes from different sources, as 
has been done in Table 1 A, to identify the sub-sequences therein that make up the 
substantially conserved amino acid sequences defined herein. Stated another way. a given 
sub-sequence from one source "corresponds to" a homologous subsequence from a 
different source. The skilled person could then detemiine whether the identified sub- 
sequences have the characteristics disclosed and claimed in the present application. 

Therefore, a preferred embodiment of the present invention is directed to a nucleic 
acid molecule comprising a nucleotide sequence isolated from a plant that encodes an 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the nucleic acid 
molecule is capable of being incorporated into a nucleic acid construct used to transfomi a 
plant containing wild-type, herbicide-sensitive protox. wherein the nucleotide sequence has 
at least one point mutation relative to a wild-type nucleotide sequence encoding plant 
protox. such that upon transfomiation with the nucleic acid construct the plant is rendered 
herbicide-tolerant. 

More particularly, a preferred embodiment of the present invention is directed to a 
nucleic acid molecule comprising a nucleotide sequence isolated from a plant that encodes 
a modified enzyme having protoporphyrinogen oxidase (protox) activity, wherein the 
modified enzyme is resistant to an inhibitor of a naturally occurring protox enzyme, wherein 
the modified enzyme comprises at least one of the following amino acid sub-sequences: 
(a) APAiF, wherein Ai is an amino acid other than arginine; 

FAaS, wherein Az is an amino acid other than cysteine; 
(c) YA3G, wherein A3 is an amino acid other than alanine; 

AA4D, wherein A4 is an amino acid other than glycine; 



(b) 



(d) 



(e) YA5P, wherein A5 is an amino acid other than proline; 

PAsA, wherein Ae is an amino acid other than valine; 
A7IG, wherein A7 is an amino acid other than tyrosine; 
YlGQAs, wherein Ag is an amino acid other than alanine or serine; 
(i) AA9P, wherein A9 is an amino acid other than isoleucine; and 

GA10A, wherein A10 is an amino acid other than valine; 
KAAiaF, wherein Ais is an amino acid other than alanine; 



(f) 

(g) 

(h) 



0) 
(k) 
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(I) QAigH, wherein A19 is an amino acid other than leucine. 

(Table 1 B; sub-sequences 1 -1 0 and 1 8-1 9). 

Prefeaed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence APA 

iF, wherein Ai is an amino acid other than arginine. Most preferably, Ai is cysteine or 
leucine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence FA 

2S, wherein A2 is an amino acid other than cysteine. Most preferably, A2 is phenylalanine, 
leucine, or lysine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence YA 

3G, wherein A3 is an amino acid other than alanine. Most preferably, A3 is valine, threonine, 
leucine, cysteine, or Isoleucine. 

Prefen'ed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occumng 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence AA 
4D, wherein A4 is an amino acid other than glycine. Most preferably, A4 is serine or leucine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence As is 
an amino acid other than proline. Most preferably, A5 is serine or histidine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
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protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence PA 
3A wherein Ae is an amino acid other than valine. Most preferably. As is leucine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence A 
y\G, wherein Av is an amino acid other than tyrosine. Most preferably. A7 is cysteine, 
isoleucine, leucine, threonine, methionine, valine, alanine, histidine, or arginine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence FA 
,S, wherein As is an amino acid other than alanine or serine. Most preferably. Aa is proline. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence AA 
9P. wherein A9 is an amino acid other than isoleucine. Most preferably. A9 is threonine. 

histidine, glycine, or asparagine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence GA 
10A. wherein A,o is an amino acid other than valine. Most preferably, A10 is alanine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence KAA 
laF. wherein A,b is an amino acid other than alanine. Most preferably. Ais is threonine or 
valine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occunring 
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protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence QA 
19H, wherein A19 is an amino acid other than leucine. Most preferably, A19 is serine. 

Another preferred embodiment of the present invention is directed to a nucleic acid 
molecule comprising a nucleotide sequence isolated from a plant that encodes a modified 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified enzyme 
is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the modified 
enzyme comprises at least one of the following amino acid sub-sequences: 



(a) APAiF, wherein Ai is an amino acid other than arginlne; 

(b) FA2S, wherein A2 is an amino acid other than cysteine; 

(c) YA3G, wherein A3 is an amino acid other than alanine; 

(d) AA4D, wherein A4 is an amino acid other than glycine; 

(e) YA5P, wherein A5 is an amino acid other than proline; 

(f) PAeA. wherein As is an amino acid other than valine; 

(g) A7IG, wherein Ay is an amino acid other than tyrosine; 

(h) YIGGAa, wherein As is an amino acid other than alanine or serine; 

(i) AA9P, wherein A9 is an amino acid other than isoleucine; and 
(j) GA10A, wherein A10 is an amino acid other than valine 

(k) KAAibF, wherein Ais is an amino acid other than alanine; 

(I) QA19H, wherein A19 is an amino acid other than leucine. 

(Table 1B; sub-sequences 1-10 and 18-19), and wherein the modified enzyme further 
comprises at least one additional amino acid sub-sequence selected from the group 
consisting of: ^ 

(m) QA11S, wherein An is an amino acid other than proline; 

(n) IGGA12, wherein A12 is an amino acid other than threonine; 

(o) SWXLA13, wherein A13 is an amino acid other than serine; 

(p) LA14Y, wherein A14 is an amino acid other than asparagine; and 

(q) GA15XGL, wherein A15 is an amino acid other than tyrosine. 



Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence YA 
3G, wherein A3 is an amino acid other than alanine, or the amino acid sub-sequence A7IG, 
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Wherein A7 is an amino acid other than tyrosine, and wherein the modified enzyme further 
comprises at least one additional amino acid sub-sequence selected from the group 
consisting of: 

(m) QA11S, wherein An is an amino acid other than proline; 
(n) IGGA12, wherein A12 is an amino acid other than threonine; 
(o) SWXLA13. wherein A13 is an amino acid other than serine; 
(p) LA14Y, wherein Au is an amino acid other than asparagine; and 
(q) GA15XGL, wherein A15 is an amino acid other than tyrosine. 
Preferably, An is leucine, A12 is isoleucine or alanine, A13 is leucine, Am is serine, 
and Ai5 is cysteine. 

Another preferred embodiment of the present invention is directed to a nucleic acid 
molecule comprising a nucleotide sequence isolated from a plant that encodes a modified 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified enzyme 
is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the modified 
enzyme comprises: the amino acid sub-sequence A7IG, wherein A7 is an amino acid other 
than tyrosine; the amino acid sub-sequences IGGA12, wherein A12 is an amino acid other 
than threonine; and the amino acid sub-sequence SWXLA13, wherein A13 is an amino acid 
other than serine. Most preferably, A? is isoleucine, A12 is isoleucine, and A13 is leucine. 

Yet another preferred embodiment of the present invention is directed to a nucleic 
acid molecule comprising a nucleotide sequence isolated from a plant that encodes a 
modified enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified 
enzyme is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the 
nucleotide sequence is further characterized in that at least one of the following conditions 
is met: 

(a) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence APA1F, wherein Ai is an amino acid other than arginine; 

(b) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence FA2S, wherein A2 is an amino acid other than cysteine; 

(c) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YA3G, wherein A3 is an amino acid other than alanine; 

(d) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence AA4D, wherein A4 is an amino acid other than glycine; 
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(e) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YA5P, wherein A5 is an amino acid other than proline; 

(f) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence PAsA, wherein As is an amino acid other than valine; 

(g) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence A7IG, wherein A7 is an amino acid other than tyrosine; 

(h) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YIGGAa, wherein As is an amino acid other than alanine or serine; 

(i) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence AA9P, wherein A9 is an amino acid other than isoleucine; 

0) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence GAioA, wherein A10 is an amino acid other than valine; 

(k) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YA3G, wherein A3 is an amino acid other than alanine, and the 
nucleic acid sequence also has a sequence that encodes one of the group 
consisting of: 

(1 ) sub-sequence QAnS, wherein An is an amino acid other than proline, 

(2) sub-sequence IGGA12, wherein A12 is an amino acid other than threonine, 

(3) sub-sequence SWXLAta, wherein A13 is an amino acid other than serine, 

(4) sub-sequence LA14Y, wherein Au is an amino acid other than asparagine, 

and 

(5) sub-sequence GA15XGL, wherein A15 is an amino acid other than tyrosine; 
(I) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence A7IG, wherein Ay is an amino acid other than tyrosine, and the 
nucleic acid sequence also has a sequence that encodes one of the group 
consisting of: 

(1) sub-sequence QAnS, wherein An is an amino acid other than proline, 

(2) sub-sequence IGGA12, wherein A12 is an amino acid other than threonine, 

(3) sub-sequence SWXLA13, wherein A13 is an amino acid other than serine, 

(4) sub-sequence LA14Y, wherein Au is an arnino acid other than asparagine, 

and 

(5) sub-sequence GA15XGL, wherein A15 is an amino acid other than tyrosine; 
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(m) the nucleic has a sequence that encodes amino acid sub-sequence TAisG, 
wherein Ais is an amino acid other than leucine, and the nucleic acid 
sequence also has a sequence that encodes amino acid sub-sequence YVA 
17G, wherein A17 is an amino acid other than alanine, 
(n) KAAiaF, wherein A18 is an amino acid other than alanine; 
(o) QA19H, wherein A19 is an amino acid other than leucine. 
Preferably, said nucleic acid sequence has a sequence that encodes amino acid 
sub-sequence TAieQ. wherein Aie is an amino acid other than leucine, and said nucleic acid 
sequence also has a sequence that encodes amino acid sub-sequence YVA17G, wherein A 

17 is an amino acid other than alanine. 

Also prefen-ed is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the arginine occumng at the position 
corresponding to amino acid 88 of SEQ ID NO:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy prefen-ed is the DNA molecule wherein the arginine is 

replaced with a cysteine or a leucine. 

Also prefen-ed is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the cysteine occurring at the position 
corresponding to amino acid 159 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is the DNA molecule wherein the cysteine is 
replaced with a phenylalanine or lysine, most preferred, wherein the cysteine is replaced 

with a phenylalanine. 

Also preferred is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the isoleucine occurring at the position corresponding to 
amino acid 41 9 of SEQ ID NO:6 is replaced with another amino acid, wherein the modified 
protox is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity, 
Particulariy preferred is a DNA molecule, wherein the isoleucine is replaced with a 
threonine, histidine, glycine or asparagine most preferred, wherein the isoleucine is 

replaced with a threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occun-ing at the position 
corresponding to amino acid 164 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
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occurring protox activity. Particularly preferred is a DNA molecule wherein the alanine is 
replaced with a threonine, leucine or valine. 

Also prefenred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the glycine occurring at the position 
corresponding to amino acid 165 of SEQ ID NO;6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the glycine is 
replaced with a serine or leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occun-ing at the position 
con-esponding to amino acid 370 of SEQ ID NO:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the tyrosine is 
replaced with a isoleucine or methionine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occuning at the position 
corresponding to amino acid 175 of SEQ ID NO:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the alanine is 
replaced with a valine or threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the leucine occun-ing at the position 
- corresponding to amino acid 337 of SEQ ID NO:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the leucine is 
replaced with a serine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occurring at the position con-esponding 
to amino acid 356 of SEQ ID N0:10 is replaced with another amino acid, wherein the 
modified protox is tolerant to a heri^icide in amounts that inhibit the naturally occurring 
protox activity. Parlicularty preferred is a DNA molecule wherein the valine is replaced with a 
leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the serine occurring at the position 
corresponding to amino acid 421 of SEQ ID N0:10 is replaced with another amino acid, 
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Wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the serine is 

replaced with a proline. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occurring at the position corresponding 
to amino acid 502 of SEQ ID NO:10 is replaced with another amino acid, wherein the 
modified protox is tolerant to a herbicide in amounts that inhibit the naturally occurring 
protox activity. Particularly preferred is a DNA molecule wherein the valine is replaced with a 
alanine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
corresponding to amino acid 21 1 of SEQ ID NO:10 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the alanine is 
replaced with a valine or threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the glycine occurring at the position 
corresponding to amino acid 212 of SEQ ID NO:10 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the glycine is 

replaced with a serine. 

Also preferred is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the isoleucine occurring at the position corresponding to 
amino acid 466 of SEQ ID NO:10 is replaced with another amino acid, wherein the modified 
protox is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity. 
Particulariy preferred is a DNA molecule wherein the isoleucine is replaced with a threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the proline occurring at the position 
corresponding to amino acid 369 of SEQ ID NO:12 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the proline is 
replaced with a serine or histidine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
corresponding to amino acid 226 of SEQ ID NO:12 is replaced with another amino acid, 
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wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule, wherein the alanine is 
replaced with a threonine or leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occurring at the position corresponding 
to amino acid 517 of SEQ ID NO:12 is replaced with another amino acid, wherein the 
modified protox is tolerant to a herbicide in amounts that inhibit the naturally occurring 
protox activity. Particularly preferred is a DNA molecule wherein the valine is replaced with a 
alanine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occurring at the position 
corresponding to amino acid 432 of SEQ ID NO:12 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occumng protox activity. Particulariy preferred is a DNA molecule wherein the tyrosine is 
replaced with a leucine or isoleucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the proline occurring at the position 
conresponding to amino acid 365 of SEQ ID NO:16 is replaced with another amino acid, 
wherein the modified protox* is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the proline is 
replaced with a serine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occurring at the position 
corresponding to amino acid 428 of SEQ ID N0:16 is replaced with another amino acid, 
wherein the modified protox is tolerant to a hertDicide in amounts that inhibit the naturally 
occumng protox activity. Particulariy prefen-ed is a DNA molecule wherein the tyrosine is 
replaced with a cysteine, histidine or arginine. 

Also preferred is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the tyrosine occurring at the position corresponding to 
amino acid 449 of SEQ ID NO:18 is replaced with another amino acid, wherein the modified 
protox is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity. 
Particulariy preferred is a DNA molecule wherein the tyrosine is replaced with a cysteine, 
leucine, isoleucine, valine or methionine. 

The present invention is further directed to a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino acid 



32 



wo 01/68826 PCT/EPOl/02753 

substitution and a second amino acid substitution; the first amino acid substitution having 
the property of conferring resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
acid substitution. Preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant is selected from the group 
consisting of maize, wheat, soybean, cotton, sugar beet, oilseed rape, rice, sorghum, sugar 
cane, and Arabidopsis. More preferred is a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox, wherein the plant is 
selected from the group consisting of maize, wheat, soybean, sugar beet, and Arabidopsis. 

Preferred is a DNA molecule wherein the second amino acid substitution occurs at a 
position selected from the group consisting of: 

(a) the position con-esponding to the serine at amino acid 305 of SEQ ID N0:2; 

(b) the position con-esponding to the threonine at amino acid 249 of SEQ ID N0:2; 

(c) the position corresponding to the proline at amino acid 1 18 of SEQ ID N0:2; 

(d) the position corresponding to the asparagine at amino acid 425 of SEQ ID N0:2; 

and 

(e) the position corresponding to the tyrosine at amino acid 498 of SEQ ID NO:2. 
Also prefen-ed is a DNA molecule wherein the first amino acid substitution occurs at 

a position selected from the group consisting of: 

(a) the position corresponding to the arginine at amino acid 88 of SEQ ID NO:6; 

(b) the position corresponding to the alanine at amino acid 164 of SEQ ID NO:6; 

(c) the position corresponding to the glycine at amino acid 165 of SEQ ID NO:6; 

(d) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6; 

(e) the position corresponding to the cysteine at amino acid 159 of SEQ ID NO:6; 

(f) the position corresponding to the isoleucine at amino acid 41 9 of SEQ ID N0:6. 

(g) the position con-esponding to the valine at amino acid 356 of SEQ ID NO:10; 

(h) the position con-esponding to the serine at amino acid 421 of SEQ ID NO:10; 

(i) the position corresponding to the valine at amino acid 502 of SEQ ID NO:10; 
(j) the position corresponding to the alanine at amino acid 21 1 of SEQ ID NO:10; 
(k) the position corresponding to the glycine at amino acid 212 of SEQ ID NO:10; 
(I) the position corresponding to the isoleucine at amino acid 466 of SEQ ID NO:10; 
(m) the position corresponding to the proline at amino acid 369 of SEQ ID NO:12; 
(n) the position corresponding to the alanine at amino acid 226 of SEQ ID NO:12; 
(o) the position corresponding to the tyrosine at amino acid 432 of SEQ ID NO:12; 
(p) the position corresponding to the valine at amino acid 517 of SEQ ID N0:12; 
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(q) the position corresponding to the tyrosine at amino acid 428 of SEQ ID NO:16; 
(r) the position con-esponding to the proline at amino acid 365 of SEQ ID NO:16; 
(s) the position conresponding to the tyrosine at amino acid 449 of SEQ ID N0:18; 
(t) the position con-esponding to the alanine at amino acid 175 of SEQ ID NO:6; 
and 

(u) the position corresponding to the leucine at amino acid 337 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox wherein the plant protox comprises an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8. 10, 12, 16, 18, 
20, 22, and 37. Most preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant protox comprises an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8. 10, 12, and 
18. 

More preferred is a DNA molecule, wherein the first amino acid substitution occurs at 
a position selected from the group consisting of: 

(a) the position con-esponding to the arginine at amino acid 88 of SEQ ID NO:6; 

(b) the position con-esponding to the alanine at amino acid 164 of SEQ ID NO:6; 

(c) the position corresponding to the glycine at amino acid 165 of SEQ ID N0:6; 

(d) the position corresponding to the tyrosine at amino acid 370 of SEQ ID NO:6; 

(e) the position corresponding to the cysteine at amino acid 159 of SEQ ID NO:6; 

(f) the position corresponding to the Isoleucine at amino acid 419 of SEQ ID NO:6 

(g) the position corresponding to the alanine at amino acid 175 of SEQ ID NO:6; 

and 

(h) ;the position corresponding to the leucine at amino acid 337 of SEQ ID NO:6- 
More preferred is a DNA molecule wherein the second amino acid substitution 

occurs at the position corresponding to the serine at amino acid 305 of SEQ ID NO:2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID NO:6; 

and 

(b) the position con-esponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule wherein the serine occurring at the position 

corresponding to amino acid 305 of SEQ ID N0:2 is replaced with leucine. 

More preferred is a DNA molecule wherein the second amino acid substitution 
occurs at the position corresponding to the threonine at amino acid 249 of SEQ ID N0:2 
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and the first amino acid substitution occurs at a position selected from the group consisting 
of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID N0:6; 

and 

(b) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA wherein the threonine occurring at the position 

corresponding to amino acid 249 of SEQ ID N0:2 is replaced with an amino acid selected 
from the group consisting of isoleucine and alanine. 

More preferred is a DNA molecule wherein the second amino acid substitution 
occurs at the position corresponding to the proline at amino acid 11 8 of SEQ ID NO:2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID NO:6; 

and 

(b) the position conresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule wherein the proline occurring at the position 

corresponding to amino acid 1 18 of SEQ ID NO:2 is replaced with a leucine. 

More preferred is a DNA molecule wherein the second amino acid substitution 
occurs at the position corresponding to the asparagine at amino acid 425 of SEQ ID N0:2 
and the first amino acid substitution occurs at a position selected from the group consisting 
of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID N0:6; 

and 

(b) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule wherein the asparagine occurring at the 

position corresponding to amino acid 425 of SEQ ID N0:2 is replaced with a serine. 

More preferred is a DNA molecule wherein the second amino acid substitution 
occurs the position corresponding to the tyrosine at amino acid 498 of SEQ ID NO:2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 1 64 of SEQ ID N0:6; 

and 

(b) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule wherein the tyrosine occurring at the 

position corresponding to amino acid 498 of SEQ ID N0:2 is replaced with a cysteine. 
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More preferred is a DNA molecule wherein the tyrosine occurring at the position 
corresponding to annino acid 370 of SEQ ID NO:6 is replaced with an amino acid selected 
from the group consisting of cysteine, isoleucine, leucine, threonine, valine and methionine. 

Particularly preferred is a DNA molecule wherein the tyrosine occurring at the 
position corresponding to amino acid 370 of SEQ ID NO:6 is replaced with an amino acid 
selected from the group consisting of cysteine, isoleucine, leucine, threonine and 
methionine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
corresponding to residue 164 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of valine, threonine, leucine, cysteine and tyrosine. 

More prefenred is a DNA molecule wherein the glycine occurring at the position 
corresponding to residue 165 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of serine and leucine. 

Particularly preferred is a DNA molecule wherein the glycine occurring at the position 
corresponding to residue 165 of SEQ ID N0:6 is replaced with a serine. 

Particulariy preferred Is a DNA molecule wherein the arginine occurring at the 
position con-esponding to residue 88 of SEQ ID NO:6 is replaced with a cysteine or a 
leucine. 

More preferred is a DNA molecule wherein the cysteine occurring at the position 
corresponding to residue 159 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of phenylalanine and lysine. 

Particularly preferred is a DNA molecule wherein the cysteine occurring at the 
position corresponding to residue 159 of SEQ ID NO;6 is replaced with a phenylalanine. 

More preferred is a DNA molecule wherein the isoleucine occurring at the position 
corresponding to residue 419 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of threonine, histidine, glycine and asparagine. 

Particulariy prefen-ed is a DNA molecule wherein the isoleucine occurring at the 
position corresponding to residue 419 of SEQ ID NO:6 is replaced with a threonine. 

Particularly preferred is a DNA molecule wherein the alanine occurring at the 
position corresponding to residue 175 of SEQ ID N0:6 is replaced with an amino acid 
selected from the group consisting of threonine and valine. 

Particularly preferred is a DNA molecule wherein the leucine occurring at the 
position corresponding to residue 337 of SEQ ID N0:6 is replaced with a serine. 

More prefen-ed Is a DNA molecule wherein the valine occurring at the position 
corresponding to residue 356 of SEQ ID NO:10 is replaced with a leucine. 
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More preferred is a DNA molecule wherein the serine occurring at the position 
corresponding to residue 421 of SEQ ID NO:10 is replaced with a proline. 

More preferred is a DNA molecule wherein the valine occurring at the position 
corresponding to residue 502 of SEQ ID NO:10 is replaced with a alanine. 

More preferred is a DNA molecule wherein the isoleucine occurring at the position 
corresponding to residue 466 of SEQ ID N0:10 is replaced with a threonine. 

More preferred is a DNA molecule wherein the glycine occurring at the position 
corresponding to residue 212 of SEQ ID N0:10 is replaced with a serine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
corresponding to residue 21 1 of SEQ ID N0:10 is replaced with a valine or threonine. 

More preferred is a DNA molecule wherein the proline occurring at the position 
corresponding to residue 369 of SEQ ID NO:12 is replaced with a serine or a histidine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
corresponding to residue 226 of SEQ ID N0:12 is replaced with a leucine or threonine. 

More preferred is a DNA molecule wherein the tyrosine occurring at the position 
corresponding to residue 432 of SEQ ID N0:12 is replaced with a leucine or isoleucine. 

More preferred is a DNA molecule wherein the valine occuning at the position 
con-esponding to residue 517 of SEQ ID N0:12 is replaced with a alanine. 

More preferred is a DNA molecule wherein the tyrosine occurring at the position 
corresponding to residue 428 of SEQ ID N0:16 is replaced with cysteine, histidine or 
arginine. 

More preferred is a DNA molecule wherein the proline occurring at the position 
corresponding to residue 365 of SEQ ID N0:16 is replaced with serine. 

More prefen-ed is a DNA molecule wherein the proline occurring at the position 
corresponding to residue 449 of SEQ ID N0:18 is replaced with an amino acid selected 
from the group consisting of leucine, isoleucine, valine and methionine. 

The present invention is still further directed to a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a double amino acid 
substitution, wherein both amino acid substitutions are required for there to be resistance to 
a protox inhibitor. Preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant is selected from the group 
consisting of maize, wheat, soybean, cotton, sugar beet, oilseed rape, rice, sorghum, sugar 
cane, and Arabidopsis. More preferred is a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox, wherein the plant is maize. 
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Preferred is a DNA molecule having a double amino acid substitution, wherein one 
amino acid substitution occurs at the position corresponding to the leucine at amino acid 
347 of SEQ ID N0:6, and wherein the second amino acid substitution occurs at the position 
corresponding to the alanine at amino acid 453 of SEQ ID NO:6. 

Particularly preferred is a DNA molecule having a double amino acid substitution, 
wherein a leucine occurring at the position corresponding to amino acid 347 of SEQ ID 
NO:6 is replaced with a serine, and wherein an alanine occurring at the position 
con-esponding to amino acid 453 of SEQ ID N0:6 is replaced with a threonine. 

The present invention is directed to expression cassettes and recombinant vectors 
comprising the expression cassettes comprising essentially a promoter, but especially a 
promoter that is active in a plant, operatively linked to a DNA molecule encoding the 
protoporphyrinogen oxidase (protox) enzyme from a eukaryotic organism according to the 
invention. The expression cassette according to the invention may in addition further 
comprise a signal sequence operatively linked to the DNA molecule, wherein the signal 
sequence is capable of targeting the protein encoded by the DNA molecule into the 
chloroplast or the mitochondria. 

The invention relates to a chimeric gene, which comprises an expression cassette 
comprising essentially a promoter, but especially a promoter that is active in a plant, 
operatively linked to a heterologous DNA molecule encoding a protoporphyrinogen oxidase 
(protox) enzyme from a eukaryotic organism according to the invention. Prefen-ed is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting oi Arabidopsis, sugar cane, 
soybean, barley, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, 
^oats,'turf and forage grasses, millet, forage and rice. More preferred is a chimeric gene, 
wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) enzyme from a 
plant selected from the group consisting of soybean, cotton, tobacco, sugar beet, oilseed 
rape, maize, wheat, sorghum, rye, oats, turf grass, and rice. Particularly preferred is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting of wheat, soybean, cotton, sugar 
beet, oilseed rape, rice and sorghum. Most preferred is a chimeric gene, wherein the DNA 
molecule encodes an protoporphyrinogen oxidase (protox) enzyme from a plant selected 
from the group consisting of soybean, sugar beet, and wheat. 

More prefen-ed is a chimeric gene comprising a promoter active in a plant operatively 
linked to a heterologous DNA molecule encoding a protoporphyrinogen oxidase (protox) 
selected from the group consisting of a wheat protox comprising the sequence set forth in 
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SEQ ID NO:10, a soybean protox comprising the sequence set forth in SEQ ID NO:12, 
cotton protox connprising the sequence set forth in SEQ ID N0:16, a sugar beet protox 
comprising the sequence set forth in SEQ ID N0:18, an oilseed rape protox comprising the 
sequence set forth in SEQ ID NO:20, a rice protox comprising the sequence set forth in 
SEQ ID NO:22, a sorghum protox comprising the sequence set forth in SEQ ID NO:24, and 
a sugar cane protox comprising the sequence set forth in SEQ ID NO:37. More preferred is 
a chimeric gene, wherein the protoporphyrinogen oxidase (protox) is selected from the 
group consisting of a wheat protox comprising the sequence set forth in SEQ ID NO: 10, a 
soybean protox comprising the sequence set forth in SEQ ID N0:12, and a sugar beet 
protox comprising the sequence set forth in SEQ ID N0:18. 

Particularty preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from an Arabidopsis species having protox-1 activity or protox-2 activity, preferably 
wherein the protein comprises the amino acid sequence set forth in SEQ ID N0:2 or SEQ 
ID N0:4. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from maize having protox-1 activity or protox-2 activity, preferably wherein the 
protein comprises the amino acid sequence set forth in set forth In SEQ ID N0:6 or SEQ ID 
NO:8. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from wheat having protox-1 activity preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:10. 

' Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from soybean having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:12. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from cotton having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID N0:16. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from sugar beet having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID N0:18. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from oilseed rape having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO:20. 
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Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from rice having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:22. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from sorghum having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:24. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from sugar cane having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO:37. 

The invention also embodies a chimeric gene, which comprises an expression 
cassette comprising essentially a promoter, but especially a promoter that is active in a 
plant, operatively linked to the DNA molecule encoding an protoporphyrinogen oxidase 
(protox) enzyme from a eukaryotic organism according to the invention, which is resistant to 
herbicides at levels that inhibit the corresponding unmodified version of the enzyme. 
Preferred is a chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen 
oxidase (protox) enzyme from a plant selected from the group consisting of Arabidopsis, 
sugar cane, soybean, bariey, cotton, tobacco, sugar beet, oilseed rape; maize, wheat, 
sorghum, rye, oats, turf and forage grasses, millet, forage and rice. More preferred is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting of soybean, cotton, tobacco, sugar 
beet, oilseed rape, maize, wheat, sorghum, rye, oats, turf grass, and rice. Particulariy 
prefprred is a chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen 
oxidase (protox) enzyme from a plant selected from the group consisting of Arabidopsis, 
soybean, cotton, . sugar beet, oilseed rape, maize, wheat, sorghum; and rice. 

Encompassed by the. present invention is a chimeric gene comprising a promoter 
that is active in a plant operatively linked to the DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a eukaryotic protox having at least one 
amino acid modification, wherein the amino acid modification has the property of conferring 
resistance to a protox inhibitor. 

Also encompassed by the present invention is a chimeric gene comprising a 
promoter that is active in a plant operatively linked to the DNA molecule encoding a 
modified protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino 
acid substitution and a second amino acid substitution; the first amino acid substitution 
having the property of conferring resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
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acid substitution. Preferred is tfie chimeric gene additionally comprising a signal sequence 
operatively linked to the DNA molecule, wherein the signal sequence is capable of targeting 
the protein encoded by the DNA molecule into the chloroplast or in the mitochondria. 

The chimeric gene according to the invention may in addition further comprise a 
signal sequence operatively linked to the DNA molecule, wherein the signal sequence is 
capable of targeting the protein encoded by the DNA molecule into the chloroplast The 
chimeric gene according to the invention may in addition further comprise a signal 
sequence operatively linked to the DNA molecule, wherein the signal sequence Is capable 
of targeting the protein encoded by the DNA molecule into the mitochondria. 

Also encompassed by the present invention is any of the DNA sequences 
mentioned herein before, which is stably integrated into a host genome. 

The invention further relates to a recombinant DNA molecule comprising a plant 
protoporphyrinogen oxidase (protox) or a functionally equivalent derivative thereof. 

The invention further relates to a recombinant DNA vector comprising the 
recombinant DNA molecule of the invention. 

A further object of the invention is a recombinant vector comprising the chimeric 
gene according to the invention, wherein the vector is capable of being stably transfonned 
into a host cell. 

A further object of the invention is a recombinant vector comprising the chimeric 
gene according to the invention, wherein the vector is capable of being stably transformed 
Into a plant, plant seeds, plant tissue or plant cell. Preferred is a recombinant vector 
comprising the chimeric gene according to the invention, wherein the vector is capable of 
being stably transfomied into a plant. The plant, plant seeds, plant tissue or plant cell stably 
transfomned with the vector is capable of expressing the DNA molecule encoding a 
protoporphyrinogen oxidase (protox). Preferred is a recombinant vector, wherein the plant, 
plant seeds, plant tissue or plant cell stably transfomied with the the vector is capable of 
expressing the DNA molecule encoding a protoporphyrinogen oxidase (protox) from a plant 
that is resistant to herbicides at levels that inhibit the corresponding unmodified version of 
the enzyme. 

Preferred is a recombinant vector comprising the chimeric gene comprising a 
promoter active in a plant operatively linked to a heterologous DNA molecule encoding a 
protoporphyrinogen oxidase (protox) selected from the group consisting of a wheat protox 
comprising the sequence set forth in SEQ ID NO:10, a soybean protox comprising the 
sequence set forth in SEQ ID NO:12, cotton protox comprising the sequence set forth in 
SEQ ID N0:1 6, a sugar beet protox comprising the sequence set forth in SEQ ID N0:18, an 
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Oilseed rape protox comprising the sequence set forth in SEQ ID NO:20, a rice protox 
comprising the sequence set forth in SEQ ID NO:22, a sorghum protox comprising the 
sequence set forth in SEQ ID NO:24, and a sugar cane protox comprising the sequence set 
forth in SEQ ID NO:37, wherein the vector is capable of being stably transformed into a host 
cell. 

Also preferred is recombinant vector comprising the chimeric gene comprising a 
promoter that is active in a plant operatively linked to the DNA molecule encoding a 
modified protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino 
acid substitution and a second amino acid substitution; the first amino acid substitution 
having the property of confening resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
acid substitution, wherein the vector is capable of being stably transformed into a plant cell. 

Also encompassed by the present invention is a host cell stably transformed with the 
vector according to the invention, wherein the host cell is capable of expressing the DNA 
molecule. Preferred iS:a host ceil selected from the group consisting of a plant cell, a 
bacterial cell, a yeast cell, and an insect celL 

The present invention Is further directed to plants and the progeny thereof, plant 
tissue and plant seeds tolerant to herbicides that inhibit the naturally occurring protox 
activity in these plants, wherein the tolerance is conferred by a gene expressing a modified 
inhibitor-resistant protox enzyme as taught herein. Representative plants include any plants i ■ 
to which these heriDicides may be applied for their normally intended purpose. Preferred are 
^ agronomically important crops, i.e., angiospemns and gymnosperms such as Arabidopsis, 
sugar cane, soybean, bariey, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, 
sorghum, rye, oats, tomato, potato, turf and forage grasses, millet, forage, and rice and the 
like. More preferred are agronomically important crops, i.e., angiosperms and gymnosperms 
such as Arabidopsis, cotton, soybean, oilseed rape, sugar beet, maize, rice, wheat, bariey, 
oats, rye, sorghum, millet, turf, forage, turf grasses. Particularly pretended are agronomically 
important crops, i.e., angiosperms and gymnosperms such as Arabidopsis, soybean, cotton, 
sugar beet, oilseed rape, maize, wheat, sorghum, and rice. 

Preferred is a plant comprising the DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino acid 
substitution and a second amino acid substitution; the first amino acid substitution having 
the property of conferring resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
acid substitution, wherein.the DNA molecule is expressed in the plant and confers upon the 
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plant tolerance to a herbicide in amounts that inhibit naturally occurring protox activity. 
Preferred is a plant, wherein the DNA molecule replaces a corresponding naturally 
occurring protox coding sequence. Comprised by the present invention is a plant and the 
progeny thereof comprising the chimeric gene according to the invention, wherein the 
chimeric gene confers upon the plant tolerance to a herbicide in amounts that inhibit 
naturally occurring protox activity. 

Encompassed by the present invention are transgenic plant tissue, including plants 
and the progeny thereof, seeds, and cultured tissue, stably transformed with at least one 
chimeric gene according to the invention. Preferred is transgenic plant tissue, including 
plants, seeds, and cultured tissue, stably transformed with at least one chimeric gene that 
comprises an expression cassette comprising essentially a promoter, but especially a 
promoter that is active in a plant, operatively linked to the DNA molecule encoding an 
protoporphyrinogen oxidase (protox) enzyme that is resistant to herbicides at levels that 
inhibit the corresponding unmodified version of the enzyme in the plant tissue. 

The present invention is further directed to plants, plant tissue, plant seeds, and 
plant cells tolerant to herbicides that inhibit the naturally occurring protox activity in these 
plants, wherein the tolerance is confen'ed by increasing expression of wild-type 
herbicide-sensitive protox. This results in a level of a protox enzyme in the plant cell at least 
sufficient to overcome growth inhibition caused by the heriDicide. The level of expressed 
enzyme generally is at least two times, preferably at least five times, and more preferably at 
least ten times the natively expressed amount. Increased expression may be due to 
multiple copies of a wild-type protox gene; multiple occurrences of the coding sequence 
within the gene (/.e. gene amplification) or a mutation in the non-coding, regulatory 
sequence of the endogenous gene in the plant cell. Plants having such altered gene 
activity can be obtained by direct selection in plants by methods known in the art (see, e.g. 
U.S. Patent No. 5,162,602, and U.S. Patent No. 4,761,373, and references cited therein). 
These plants also may be obtained by genetic engineering techniques known in the art. 
Increased expression of a heriDicide-sensitive protox gene can also be accomplished by 
stably transforming a plant cell with a recombinant or chimeric DNA molecule comprising a 
promoter capable of driving expression of an associated structural gene in a plant cell 
operatively linked to a homologous or heterologous structural gene encoding the protox 
enzyme. 

The recombinant DNA molecules of the invention can be introduced into the plant 
cell in a number of art-recognized ways. Those skilled in the art will appreciate that the 
choice of method might depend on the type of plant, i.e. monocot or dicot, targeted for 
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transformation. Suitable methods of transforming plant cells include microinjection 

(Crossway ef a/., BioTechniques 4.-320-334 (1986)), electroporation (Riggs etal, Proc. Natl. 

Acad. ScL USA 83:5602-5606 (1986), Agrobacterium mediated transfomr^ation (Hinchee et 

aL, Biotechnology 5:915-921 (1988)), direct gene transfer (Paszkowski et ai, EMBO J. 

3.-271 7-2722 (1984)), ballistic particle acceleration using devices available from Agracetus, 

Inc., Madison, Wisconsin and Dupont, Inc., Wilmington, Delaware {see, for example, 

Sanford ef a/., U.S. Patent 4,945,050; and McCabe etaL, Biotechnology 6:923-926 (1988)), 

protoplast transformation/regeneration methods {see U.S. Patent No. 5,350,689 issued 

Sept. 27, 1994 to Ciba-Geigy Corp.), and pollen transformation {see U.S. Patent No. 

5,629,183). Also see, Weissinger etaL, Annual Rev, Genet 22:421-477 (1988); Sanford et 

al, Particulate Science and Technology 527-37 (1987)(onion); Christou etaL, Plant 

PhysioL 87:671-674 (1 988)(soybean); McCabe etaL, Bio/Technology 6:923-926 

(1 988)(soybean); Datta etaL, Bio/Technology 8:736-740 (1990)(rice); Klein etaL, Proc, 

NatL Acad. ScL USA, 85:4305-4309 (1988)(maize); Klein etaL, Bio/Technology e,559-563 

(1988)(maize); K\emet aL, Plant PhysioL 9y:440-444 (1988)(mai2e); Fromm etaL, 

Bio/Technology 8.'833-839 (1 990); Gordon-Kamm ef aL, Plant Cell 2:603-61 8 (1 990) 

(maize); and U.S. Patent Nos. 5,591,616 and 5,679,558 (rice). 

Comprised within the scope of the present invention are transgenic plants, in 

particular transgenic fertile plants transformed by means of the aforedescribed processes 

and their asexual and/or sexual progeny, which still are resistant or at least tolerant to 

inhibition by a herbicide at levels that nonnally are inhibitory to the naturally occurring 

• • ,. ■ ' ' , ■ ■ ■ ■ 
^ protox activity in the plant. Progeny plants also include plants with a different genetic 

^•background than the parent piant, which plants result from a backcrbssing program and still 

' comprise in their genome the herbicide resistance trait according to the invention. Very 

especially prefen'ed are hybrid plants that are resistant or at least tolerant to inhibition by a 

herbicide at levels that normally are inhibitory to the naturally occurring protox activity in the 

plant. 

The transgenic plant according to the invention may be a dicotyledonous or a 
monocotyledonous plant. Preferred are monocotyledonous plants of the Gramlnaceae 
family involving Lolium, Zea, Triticum, Triticale, Sorghum, Saccharum, Bromus, Oryzae, 
Avena, Hordeum, Secale and Setaria plants. More preferred are transgenic maize, wheat, 
bariey, sorghum, rye, oats, sugar cane, turf and forage grasses, millet and rice. Especially 
preferred are maize, wheat, sorghum, rye, oats, turf grasses and rice. 
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Among the dicotyledonous plants Arabidopsis, soybean, cotton, sugar beet, oilseed 
rape, tobacco, tomato, potato, and sunflower are more preferred herein. Especially 
preferred are soybean, cotton, tobacco, sugar beet, tomato, potato, and oilseed rape. 

The expression 'progeny' is understood to embrace both, "asexually" and "sexually" 
generated progeny of transgenic plants. This definition is also meant to include all mutants 
and variants obtainable by means of known processes, such as for example cell fusion or 
mutant selection and that still exhibit the characteristic properties of the initial transformed 
plant, together with all crossing and fusion products of the transformed plant material. This 
also includes progeny plants that result from a backcrossing program, as long as the 
progeny plants still contain the herbicide resistant trait according to the invention. 

Another object of the invention concerns the proliferation material of transgenic 
plants. The proliferation material of transgenic plants is defined relative to the invention as 
any plant material that may be propagated sexually or asexually in vivo or in vitro. 
Particularly preferred within the scope of the present invention are protoplasts, cells, calli, 
tissues, organs, seeds, embryos, pollen, egg cells, zygotes, together with any other 
propagating material obtained from transgenic plants. 

Parts of plants, such as for example flowers, stems, fruits, leaves, roots originating 
in transgenic plants or their progeny previously transformed by means of the process of the 
invention and therefore consisting at least in part of transgenic ceils, are also an object of 
the present invention, 

A further object of the invention is a method of producing plants, protoplasts, cells, 
calli, tissues, organs, seeds, embryos, pollen, egg cells, zygotes, together with any other 
propagating material, parts of plants, such as for example flowers, stems, fruits, leaves, 
roots originating in transgenic plants or their progeny previously transformed by means of 
the process of the invention, which therefore produce an inhibitor resistant fomn of a plant 
protox enzyme by transforming the plant, plant parts with the DNA according to the 
invention. Preferred is a method of producing a host cell comprising an isolated DNA 
molecule encoding a protein from a eukaryote having protoporphyrinogen oxidase (protox) 
activity comprising transfonning the host cell with a recombinant vector molecule according 
to the invention. Further preferred is a method of producing a plant cell comprising an 
isolated DNA molecule encoding a protein from a eukaryote having protoporphyrinogen 
oxidase (protox) activity comprising transforming the plant cell with a recombinant vector 
molecule according to the invention. Preferred is a method of producing transgenic progeny 
of a transgenic parent plant comprising an isolated DNA molecule encoding a protein from a 
eukaryote having protoporphyrinogen oxidase (protox) activity comprising transforming the 
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parent plant with a recombinant vector molecule according to the invention and transfenring 
the herbicide tolerant trait to the progeny of the transgenic parent plant involving known 
plant breeding techniques. 

Preferred is a method for the production of plants, plant tissues, plant seeds and 
plant parts, which produce an inhibitor-resistant fomn of the plant protox enzyme, wherein 
the plants, plant tissues, plant seeds and plant parts have been stably transformed with a 
structural gene encoding the resistant protox enzyme. Particularly preferred is a method for 
the production of plants, plant tissues, plant seeds and plant parts, wherein the plants, plant 
tissues, plant seeds and plant parts have been stably transformed with the DNA according 
to the invention. Especially preferred Is a method for the production of the plants, plant 
tissues, plant seeds and plant parts, which produce an inhibitor-resistant fomri of the plant 
protox enzyme, wherein the plants, plant tissues, plant seeds and plant parts have been 
prepared by direct selection techniques whereby herbicide resistant lines are isolated, 
characterized and developed. 

The genetic properties engineered into the transgenic seeds and plants described 
above are passed on by sexual reproduction or vegetative growth and can thus be 
maintained and propagated in progeny plants. Generally the maintenance and propagation 
make use of known agricultural methods develpped to fit specific purposes such as tilling, 
sowing or harvesting. Specialized processes such as hydroponics or greenhouse 
technologies can also be applied. As the growing crop is vulnerable to attack and damages 
caused by insects or infections as well-as to competition by weed plants, measures are 
undertaken to control weeds, plant diseases, insects, nematodes, and other adverse 
conditions to improve yield. These include mechanical measures such a tillage of the soil or 
removal of weeds and infected plants, as well as the application of agrochemicals such as 
herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and 
insecticides. 

Use of the advantageous genetic properties of the transgenic plants and seeds 
according to the invention can further be made in plant breeding that aims at the 
development of plants with improved properties such as tolerance of pests, herbicide 
tolerance, or stress tolerance, improved nutritional value, increased yield, or improved 
structure causing less loss from lodging or shattering. The various breeding steps are 
characterized by well-defined human inten/ention such as selecting the lines to be crossed, 
directing pollination of the parental lines, or selecting appropriate progeny plants. 
Depending on the desired properties different breeding measures are taken. The relevant 
techniques are well known in the art and include but are not limited to hybridization, 
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inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, 
aneuploid techniques, etc. Hybridization techniques also include the sterilization of plants to 
yield male or female sterile plants by mechanical, chemical or biochemical means. Cross 
pollination of a male sterile plant with pollen of a different line assures that the genome of 
the male sterile but female fertile plant will uniformly obtain properties of both parental lines. 
Thus, the transgenic seeds and plants according to the invention can be used for the 
breeding of improved plant lines that for example increase the effectiveness of conventional 
methods such as herbicide or pesticide treatment or allow to dispense with the methods due 
to their modified genetic properties. Alternatively new crops with improved stress tolerance 
can be obtained that, due to their optimized genetic "equipment", yield harvested product of 
better quality than products that were not able to tolerate comparable adverse 
developmental conditions. 

In seeds production germination quality and uniformity of seeds are essential 
product characteristics, whereas gemiination quality and uniformity of seeds harvested and 
sold by the famner is not important. As it is difficult to keep a crop free from other crop and 
weed seeds, to control seedbome diseases, and to produce seed with good germination, 
fairly extensive and well-defined seed production practices have been developed by seed 
producers, who are experienced in the art of growing, conditioning and marketing of pure 
seed. Thus, it is common practice for the farmer to buy certified seed meeting specific 
quality standards instead of using seed harvested from his own crop. Propagation material 
to be used as seeds is customarily treated with a protectant coating comprising herbicides, 
insecticides, fungicides, bactericides, nematicides, molluscicides or mixtures thereof. 
Customarily used protectant coatings comprise compounds such as captan. carboxin, 
thiram (TMTD®), methalaxyl (Apron®), and pirimiphos-methyl (Actellic®). If desired these 
compounds are formulated together with further carriers, surfactants or application- 
promoting adjuvants customarily employed in the art of formulation to provide protection 
against damage caused by bacterial, fungal or animal pests. The protectant coatings may 
be applied by impregnating propagation material with a liquid formulation or by coating with 
a combined wet or dry fomnulation. Other methods of application are also possible such as 
treatment directed at the buds or the fmit. 

It is thus a further object of the present invention to provide plant propagation 
material for cultivated plants, but especially plant seed that is treated with an seed 
protectant coating customarily used in seed treatment. 

It is a further aspect of the present invention to provide new agricultural methods 
such as the methods exemplified above, which are characterized by the use of transgenic 
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plants, transgenic plant material, or transgenic seed according to the present invention. 
Comprised by the present invention is an agricultural method, wherein a transgenic plant or 
the progeny thereof is used comprising a chimeric gene according to the invention in an 
amount sufficient to express herbicide resistant forms of herbicide target proteins in a plant 
to confer tolerance to the herbicide. 

To breed progeny from plants transformed according to the method of the present 
invention, a method such as that which follows may be used: maize plants produced as 
described in the examples set forth below are grown in pots in a greenhouse or in soil, as is 
known in the art, and permitted to flower. Pollen is obtained from the mature tassel and 
used to pollinate the ears of the same plant, sibling plants, or any desirable maize plant. 
Similarly, the ear developing on the transformed plant may be pollinated by pollen obtained 
from the same plant, sibling plants, or any desirable maize plant. Transfonned progeny 
obtained by this method may be distinguished from non-transformed progeny by the 
presence of the introduced gene(s) and/or accompanying DNA (genotype), or the 
phenotype conferred. The transformed progeny may similarly be selfed or crossed to other 
plants, as is normally done with any plant carrying a desirable trait. Similarly, tobacco or 
other transformed plants produced by this method may be selfed or crossed as is known in 
the art in order to produce progeny with desired characteristics. Similarly, other transgenic 
organisms produced by a combination of the methods known in the art and this invention 
may be bred as. is known in the art in order to produce progeny with desired characteristics. 

The modified inhibitor-resistant protox enzymes of the invention have at least one 
amino acid substitution, addition or deletion relative to their naturally occurring counterpart 
v (i:e. inhibitor-sensitive fonms that occur naturally: in a plant without being manipulated, either 
directly via recombinant DNA methodology or indirectly via selective breeding, etc., by man). 
• Amino acid positions that may be modified to yield an inhibitor-resistant form of the protox 
enzyme, or enhance inhibitor resistance, are indicated in bold type in Table 1 A in the 
context of plant protox-1 sequences from Arabidopsis, maize, soybean, cotton, sugar beet, 
oilseed rape, rice, sorghum, wheat, and sugar cane. The skilled artisan will appreciate that 
equivalent changes may be made to any plant protox gene having a structure sufficiently 
similar to the protox enzyme sequences shown herein to allow alignment and identification 
of those amino acids that are modified according to the invention to generate inhibitor- 
resistant forms of the enzyme. Such additional plant protox genes may be obtained using 
standard techniques as described in Intemational application no. PCT/IB95/00452 filed 
June 8, 1995, published Dec. 21, 1995 as WO 95/34659 whose relevant parts are herein 
incorporated by reference. 
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DNA molecules encoding the herbicide resistant protox coding sequences taught 
herein may be genetically engineered for optimal expression In a crop plant. This may 
include altering the coding sequence of the resistance allele for optimal expression In the 
crop species of interest. Methods for modifying coding sequences to achieve optimal 
expression in a particular crop species are well known (see, e.g. Perlak etal., Proa Natl. 
Acad Sci. USA 88:3324 (1991); Koziel ef a/., Bio/technol. 77; 194 (1993)). 

Genetically engineering a protox coding sequence for optimal expression may also 
include operatively linking the appropriate regulatory sequences (i.e. promoter, signal 
sequence, transcriptional terminators). Examples of promoters capable of functioning in 
plants or plant cells (i.e., those capable of driving expression of the associated structural 
genes such as protox in plant cells) include the cauliflower mosaic virus (CaMV) 19S or 35S 
promoters and CaMV double promoters; nopaline synthase promoters; 
pathogenesis-related (PR) protein promoters; small subunit of ribulose bisphosphate 
carboxylase (ssuRUBISCO) promoters, heat shock protein promoter from Brassica with 
reference to EPA 0 559 603 (hspSO promoter), Arabidopsis actin promoter and the 
SuperMas promoter with reference to WO 95/14098 and the like. Preferred promoters will 
be those that confer high level constitutive expression or, more preferably, those that confer 
specific high level expression in the tissues susceptible to damage by the herbicide. 
Preferred promoters are the rice actin promoter (McElroy etal., Mol. Gen. Genet. 231: 150 
(1991)), maize ubiquitin promoter (EP 0 342 926; Taylor etal., Plant Cell Rep. 12: 491 
(1993)), and the PR-1 promoter from tobacco, Arabidopsis, or maize (see U.S. Patent No. 
5,614,395 to Ryals ef a/., incorporated by reference herein in its entirety). The promoters 
themselves may be modified to manipulate promoter strength to increase protox 
expression, in accordance with art-recognized procedures. 

The inventors have also discovered that another preferred promoter for use with the 
inhibitor-resistant protox coding sequences is the promoter associated with the native 
protox gene (i.e. the protox promoter; see copending, co-owned U.S. Patent Application No. 
08/808,323, entitled "Promoters from Protoporphyrinogen Oxidase Genes", incorporated by 
reference herein in its entirety.) The promoter sequence from an /4rad/dops/s protox-1 gene 
is set forth in SEQ ID N0:13, the promoter sequence from a maize protox-1 gene is set 
forth in SEQ ID N0:14, and the promoter sequence from a sugar beet protox-1 gene is set 
forth in SEQ ID N0:26. 

Since the protox promoter itself is suitable for expression of inhibitor-resistant protox 
coding sequences, the modifications taught herein may be made directly on the native 
protox gene present in the plant cell genome without the need to construct a chimeric gene 
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with heterologous regulatory sequences. Such modifications can be made via directed 
mutagenesis techniques such as homologous recombination and selected for based on the 
resulting herbicide-resistance phenotype (see, e.g. Example 10, Pazkowski etaL, EMBOJ. 
7:4021-4026 (1988), and U.S. Patent No. 5,487,992, particularly columns 18-19 and 
Example 8). An added advantage of this approach is that besides containing the native 
protox promoter, the resulting modified gene will also include any other regulatory elements, 
such as signal or transit peptide coding sequences, which are part of the native gene. 

In the event of transformation of the nuclear genome, signal or transit peptides may 
be fused to the protox coding sequence in chimeric DNA constructs of the invention to 
direct transport of the expressed protox enzyme to the desired site of action. Examples of 
signal peptides include those natively linked to the plant pathogenesis-related proteins, e.g. 
PR-1, PR-2, and the like. See, e.g., Payne ef a/.. Plant MoL Biol 7 7:89-94 (1988). 
Examples of transit peptides include the chloroplast transit peptides such as those 
described in Von Heijne ef a/., Plant MoL BtOlrRep: 9:104-126 (1991); Mazur etal.. Plant 
PhysioL 85: 1 1 1 0 (1 987); Vorst et aL, Gene 65: 59 (1 988), and mitochondrial transit 
peptides such as those described in Boutry ef a/., Nature 328.-340-342 (1987). Chloroplast 
and mitochondrial transit peptides are contenriplated to be particulariy useful with the 
present invention as protox enzymatic activity typically occurs within the mitochondria and 
chloroplast. Most prefen^ed for use are chloroplast transit peptides, as inhibition of the 
protox enzymatic activity in the chloroplasts is contemplated to be the primary basis for the 
action of protox-inhibiting heriDicides (Witkowski and Hailing. P/anf PhysioL 87: 632 (1988); 
' Lehnen ef a/.> Pestic, Blochem. PhysioL 37: 239 (1990); Duke et aL, WeedScL 3ft 465 
:: {1991));' Also included are sequences that result in localization of the encoded protein to 
various cellular compartments such as the vacuole. See, for example, Neuhaus et aL, Proc, 
NatL Acad ScL USA 88: 10362-10366 (1991) and Chrispeels, Ann, Rev, Plant PhysioL 
Plant MoL BioL 42: 21-53 (1991). The relevant disclosures of these publications are 
incorporated herein by reference in their entirety. 

Chimeric genes of the invention may contain multiple copies of a promoter or 
multiple copies of the protox structural genes. In addition, the construct(s) may include 
coding sequences for mari<ers and coding sequences for other peptides such as signal or 
transit peptides, each in proper reading frame with the other functional elements in the DNA 
molecule. The preparation of such constmcts are within the ordinary level of skill in the art. 

Useful markers include peptides providing herbicide, antibiotic or drug resistance, 
such as, for example, resistance to hygromycin, kanamycin, G418, gentamycin, lincomycin, 
methotrexate, glyphosate, phosphinothricin, or the like. These mari<ers can be used to 
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select cells transformed with the chimeric DNA constructs of the invention from 
untransformed cells. Other useful markers are peptidic enzymes that can be easily 
detected by a visible reaction, for example a color reaction, for example luciferase, 
B-glucuronidase, or B-galactosidase. 

The method of positive selection of genetically transformed cells into which a 
desired nucleotide sequence can be incorporated by providing the transfonned cells with a 
selective advantage is herein incorporated by reference as WO 94/20627. 

Where a herbicide resistant protox allele is obtained via directed mutation of the 
native gene in a crop plant or plant cell culture from which a crop plant can be regenerated, 
it may be moved into commercial varieties using traditional breeding techniques to develop 
a herbicide tolerant crop without the need for genetically engineering the modified coding 
sequence and transforming it into the plant. Alternatively, the herbicide resistant gene may 
be isolated, genetically engineered for optimal expression and then transformed into the 
desired variety. 

Genes encoding altered protox resistant to a protox inhibitor can also be used as 
selectable markers in plant cell transformation methods. For example, plants, plant tissue 
or plant cells transfonned with a transgene can also be transfonned with a gene encoding 
an altered protox capable of being expressed by the plant. The thus-transformed cells are 
transferred to medium containing the protox inhibitor wherein only the transformed cells will 
sun/ive. Protox inhibitors contemplated to be particularly useful as selective agents are the 
diphenylethers {e.g. acifluorfen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2-nitrobezoic acid; 
its methyl ester; or oxyfluorfen. 2-chloro-1-(3-ethoxy-4-nitrophenoxy)-4-(trifluoroben2ene)}, 
oxidiazoles, (e.g. oxidiazon, 3-[2,4-dichloro-5-(1-methylethoxy)phenyl]-5-(1,1-dimethylethyl)- 
1,3,4-oxadiazol-2-(3W)-one), cyclic imides (e.g. S-23142, /V-(4-chloro-2-fluoro-5- 
propargyloxyphenyl)-3,4,5.6-tetrahydrophthalimide; chlorophthalim, A/-(4-chlorophenyl)- 
3,4,5,6-tetrahydrophthalimide). phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1 -(2,3,4- 
trichlorophenyl)-4-nitropyrazoIyl"5-oxy]propionate; M&B 39279), pyridine derivatives, such 
as e.g. pyridyl amides (e.g. LS 82-556), and phenopylate and its Ophenylpyrrolidino- and 
piperidinocarbamate analogs and triazolones, such as bicyclic triazolones as disclosed in 
the International patent application WO 92/04827; EP 532146). 

The method is applicable to any plant cell capable of being transformed with an 
altered protox-encoding gene, and can be used with any transgene of interest. Expression 
of the transgene and the protox gene can be driven by the same promoter functional on 
plant cells, or by separate promoters. 
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Modified Inhibitor-resistant protox enzymes of the present invention are resistant to 
herbicides that inhibit the naturally occurring protox activity. The herbicides that inhibit 
protox include many different structural classes of molecules (Duke et aL, Weed ScL 39: 
465 (1991); Nandihalli etaL, Pesticide Biocfiem. Physiol. 43: 193 (1992); Matringe ef a/., 
FEBS Lett. 245: 35 (1 989); Yanase and Andoh, Pesticide Biochem. Physiol. 35: 70 (1 989)), 
including the diphenylethers {e.g. acifluorlfen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2- 
nitrobezoic acid; its methyl ester; or oxyfluorfen, 2-chloro-1 -(3-ethoxy-4-nitrophenoxy)-4" 
(trifluorobenzene)}, oxidiazoles (e.g. oxidiazon, 3-[2,4-dichloro-5-(1-methylethoxy)phenyl]-5- 
(1,1"dimethylethyl)-1,3,4-oxadiazol-2-(3H)-one), cyclic imides (e.g. S-23142, Af-(4-chloro-2- 
fluoro-5-propargyloxyphenyl)-3,4,5,6-tetrahydrophthalimide; chlorophthalim, A^(4- 
chlorophenyl)-3,4,5.6-tetrahydrophthalimide). phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1- 
(2,3,4-trichlorophenyl)-4-nitropyra2olyl-5-oxy]propionate; M&B 39279), pyridine derivatives 
(e.g. LS 82-556), and phenopylate and its 0-phenylpyrrolidino- and piperidinocarbamate 
analogs. 

The diphenylethers of particular significance are those having the general formula 

NO2 

(Formula I) 

wherein R equals -COONa (Formula II), -CONHSO2CH3 (Fonmula III) or -COOCH2COOC2H5 
(Formula IV; see Maigrot etaL, Brighton Crop Protection Conference-Weeds: 47'5^ (1989)). 
Additional diphenylethers of interest are. those where R equals: 

NOGH2COOCH3 
CGH2OCH3 

(Formula IVa; see Hayashi et aL, Brighton Crop Protection Conference-Weeds: 53-58 
(1989)). 

An additional diphenylether of interest is one having the formula: 

COOCH3 





(Formula IVb; bifenox, see Dest etaL, Proc. Northeast Weed ScL Conf. 27:31 (1973)). 
A further diphenylether of interest is one having the formula: 
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OCH2CH3 




CI 

(Formula IVc; oxyfiuorfen; see Yih and Swithenbank, J. Agric. Food Chem., 23: 592 (1975)) 
Yet another diphenylether of interest is one having the formula: 

CO2CHCO2CH2CH3 




Cl 



(Formula IVd; lactofen, see page 623 of 'The Pesticide Manual", 10*^ ed., ed. by C. Tomlin. 
British Crop Protection Council, Surrey (1 994)) 

Also of significance are the class of herbicides known as imides, having the general 
formula 




(Formula V) 



o (Formula VI) 

or 

o 




N- (Fomriula VII) 

or 
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CR 



(Formula VIII) 



or 



HFoC 



or 



r ;n- 



(Formula IX) 




(Formula IXa) 



or 




OCHF2 



(Formula IXb) 

(see Hemper et aL (1995) in "Proceedings of the Eighth International Congress of Pesticide 
Chemistry", Ragdale etaL, eds., Amer. Chem. Soc, Washington, D.C., pp.42-48 (1994)); 
and -Rv equals H, GI -or F. Ri equals CI and R3 is an optimally substituted ether, thioether, 
ester^ amino or alkyi group. Alternatively, R2 and R3 together may form a 5 or 6 membered 
heterocyclic ring. Examples of imide herbicides of particular interest are 




SCH2CO2CH3 



(Formula Vila; fluthiacet-methyl, see Miyazawa etaL, Brighton Crop Protection Conference- 
Weeds, pp. 23-28 (1993)) 
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.o\ o 

CH3SO2NH (Formula X sulfentrazone, see Van Saun et ai, 

Brighton Crop Protection Conference-Weeds, pp. 77-82 (1991)). 



lL^N.^^;v^COOCH(Chi)2 



(Formula XI) 



CI- 



H5C2OOCCH2O' 




CI 



J 



N 




N 
I 

CH3 



OCHF2 



(Formula Xil) 
(see Miura ef a/., Brighton Crop Protection 
Conference-Weeds: 35-40 (1 993)) 



scHjCOOCht (Formula XIII) 




P F 



N- 




CI 



OCHgCOOC^H,! (Formula XIV) 




O F 



O 



0-CHCsCH 



(Formula XV) 
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O 



F 




O 



(Formula XVI) 



The herbicidal activity of the above compounds is described in the Proceedings of 
the 1991 Brighton Crop Protection Conference, Weeds (British Crop Protection Council) 
(Formulae X and XVI), Proceedings of the 1993 Brighton Crop Protection Conference, 
Weeds (British Crop Protection Council) (Formulae XII and XIII), U.S. Patent No. 4,746,352 
(Formula XI) and Abstracts of the Weed Science Society of America vol. 33, pg. 9 
(1993)(FormulaXIV). 

The most preferred imide herbicides are those classified as aryluracils and having 
the general formula 



COOR (Fonnula XVII) 
wherein R signifies the group (C2.6-alkenyloxy)cartDonyI-Ci-4-alkyl, as disclosed in U.S. 
Patent No. 5,183,492, herein incorporated by reference. 

.Also of significance are herbicides having the general formula: 

^Ol ^F 





(Fomriula XVIII; thiadiazimin) 
(see Weiler et ai, Brighton Crop Protection Conference- 
Weeds, pp. 29-34 (1 993)); 
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CH3CH2O 




N-CHF2 



CH 



(Formula XIX; carfentrazone) 
(see Van Saun ef a/., Brighton Crop Protection Conference- 
Weeds: pp. 1 9-22 (1 993)); 
N-substituted pyrazoles of the general formula: 




(Fomnula XX) 

wherein Ri is CrC4-alkyI, optionally substituted by one or more halogen atoms; 

R2 is hydrogen, or a Ci-C4-alkoxy, each of which is optionally substituted by one or 
more halogen atoms, or 

Ri and R2 together from the group -(CH2)n-X-, where X is bound at R2; 
R3 is hydrogen or halogen, 
R4 is hydrogen or Ci-C4-alkyl, 

R5 is hydrogen, nitro, cyano or the group -COORe or -CONRyRg, and 

Re is hydrogen, CrCe-alkyI, Ca-Ce-alkenyl or Ca-Cs-alkynyl; 

(see international patent publications WO 94/08999, WO 93/10100, and U. S. 
Patent No. 5,405,829 assigned to Schering); 
N-phenylpyrazoles, such as: 
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CF3 

(Formula XXI; nipyraclofen) 
{see page 621 of 'The Pesticide Manual", 9th ed., ed. by C.R. Worthing, 

British 

Crop Protection Council, Surrey (1991)); 
and 3-substituted-2-aryl-4,5,6,7-tetrahydroinda2oles (Lyga et aL Pesticide Sci, 42:29-36 
(1994)). 




CHF2 

(Formula XXIa; BAY 1 1 340) 

Also of significance are phenylpyrazoles of the type described in WO 96/01254 and 
WO 97/00246, both of which are hereby incorporated by reference. (Fonnula XXII), 

Also of significance are pyridyl pyrazoles such as the following: 
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CGA 374410 

NOA 401954 

(Formula XXllla) (Formula XXIIIb) 



Also of significance are phenylpyrazoles having the general formula: 

o 

II 



C— OPr-i 




F3C Br 



(Formula XXIV: Fluazolate) 
(seeProsch, S. D. eta I., Brighton Crop Protection Conference- 
Weeds: pp. 45-50 (1991), (Vol. 1)) 

Additional compounds of significance are described in W098/33927 and US patent 
5,856,495, both of wliicli are incorporated herein by reference in their entirety. 

Additional compounds of significance are also, for example, Chlornitrofen (4- 
nitrophenyi 2,4,6-trichlorophenyiether), Ethoxyfen (ethyl 0-[2-chloro-5-(2-chloro-a,a,a,- 

trifluoro-p-tolyioxy)benzoyl]-L-lactate), Formesafen(5-(2-chloro-a,a,a-trifIuoro-p-tolyloxy)-N- 
methyl-sulfonyl-2-nitro-benzamide), Azafenidin (2-(2,4-dichloro-5-prop-2-ynyloxyphenyl)- 
5,6,7,8-tetrahydro-1 ,2,4-triazolo[4,3-a]pyridin-3-(2H)-one), Butafenacil or Fluobutracil (2-[2- 

chloro-5-(3,6-dihydro-2,6-dioxo-3-methyl-4-trifluoromethyl-1-(2H)-pyrimidinyl)-benzoyloxy]-2- 
methyl-propionic acid allyl ester), Cinidon-ethyl (2-chloro-3-[2-chloro-5-(cyclohex-1-en-1 ,2- 
dicarboximido)phenyl]acrylic acid ethyl ester), Fluthiacet-methyl (methyl[2-chloro-4-fluoro-5- 
(5,6,7,8-tetrahydro-3-oxo-1 H,3H-[1 ,3,4]thiadiazolo[3,4 a] pyridazin-1 -ylideneamino) 

phenylthio]acetate), Oxadiargyl(5-tert.-butyi-3-[2,4-dichloro-5-(prop-2-ynyloxy)phenyl]- 
1 ,3,4-oxadiazol-2(3H)-one), Oxadiazon (5-tert.-buty[-3-(2,4-dichloro-5-isopropoxyphenyl)- 
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1 ,3,4-oxa-diazol-2(3H)-one), Pentoxazone (3-{4-chloro-5-cyclopentyloxy-2-fluorophenyl)-5- 
isopropylidene-1 ,3-oxazolidine-2,4-dione), Pyraflufen-ethyl (ethyl 2 chIoro-5-(4-chloro-5- 
difluoromethoxy-1-methyl-pyra2ol-3-yl)-4-fluorophenoxy acetate), Flumipropyn ((+)-N-[4- 

chloro-2-fluoro-5-{1 -methylprop-2-ynyl)oxy phenyl]cycIohex-1 -ene-1 ,2-dicarboximide), 
Flupropacil (2-chloro-5-(1 ,2,3,6-tetrahydro-3-methyl-2,6<iioxo-4-trif luoromethyl-pyrimidin-1 - 
yl)benzoic acid isopropyl ester), Thidiazimin ((Z)-6-(6,7-dihydro-6,6-dimethyl-3H,5H-pyrrolo-[ 
2J-c]thiadiazol-3-ylideneamino)-74luorG-4-(2-propynyl)-2H-1,4-benzoxazin-3-(4H)-one), 
Benzfendizone (methyl 2'\ 5-ethyl-2-[4-(1 ,2,3.6-tetrahydrO"3-methyl-2,6-<lioxo-4- 
trifluoromethyl-pyrimidin-1-yl)phenoxymethyl]phenoxy ^propionate), Pyrazogyl or Pyraclonil 
(1 -(3-chloro-4,5,6,7-tetrahydropyrazolo[1 ,5-a]pyridin-2-yl)-5-(methyl-2"propynylamino)-l H- 
pyrazole-4-carbonitirile), Profluazole (1-chloro-N-[2-chloro-4-fiuoro-5-[(6S, 7aR)-6- 

fluorotetrahydro-1 ,3-dioxo-1 H-pyrrolo[1 ,2-c]imidazol-2(3H)-yl]phenyl] methanesulfonamide). 

An overview of such compounds is also found in Teroxidizing Herbicides", ed. by 
Springer, Berlin, eds. P. Soger & K. Wakabayashi, 1999. 

Levels of herbicide that normally are inhibitory to the activity of protox include 
application rates.known in the art, and that depend partly on external f actors such as 
environment, time and -method of application. For example, in the case of the imide 
herbicides represented by Formulae V through IX, and more particularly those represented 
by Formulae X through XVII, the application rates range from 0.0001 to 10 kg/ha, preferably 
from'0:005 to 2-kg/ha. This dosage rate or concentration of herbicide may be different, 
depending on the desired action and particular compound used, and can be determined by 
-^methods known^invthe art. 

^ A further object of , the invention is a method for controlling the growth of undesired 
vegetation that comprises applying to a population of the plant selected from a group 
consisting of Arabidopsis, sugar cane, soybean, barley, cotton, tobacco, sugar beet, oilseed 
rape, maize, wheat, sorghum, rye, oats, turf and forage grasses , millet, forage and rice and 
the like an effective amount of a protox-inhibiting herbicide. Preferred is a method for 
controlling the growth of undesired vegetation, which comprises applying to a population of 
the selected from the group consisting of selected from the group consisting of soybean, 
cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, oats, turf grasses 
and rice an effective amount of a protox-inhibiting heriDicide, Particularly preferred Is a 
method for controlling the growth of undesired vegetation, which comprises applying to a 
population of the selected from the group consisting of Arabidopsis, soybean, cotton, sugar 
beet, oilseed rape, maize, wheat, sorghum, and rice. 
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III. Plastid Transformation and Expression 

The present invention further encompasses a chimeric gene comprising a promoter 
capable of expression in a plant plastid operatively linked to a DNA molecule of the present 
invention. A preferred promoter capable of expression In a plant plastid is a promoter 
isolated from the 5' flanking region upstream of the coding region of a plastid gene, which 
may come from the same or a different species, and the native product of which is typically 
found in a majority of plastid types including those present in non-green tissues. Examples 
of such promoters are promoters of c/pP genes, such as the tobacco c/pP gene promoter 
(WO 97/06250, incorporated herein by reference) and the Arabidopsis c/pP gene promoter 
(U.S. Application No. 09/038,878, incorporated herein by reference). Other promoters that 
are capable of expressing a DNA molecule in plant plastlds are promoters recognized by 
viral RNA polymerases. Preferred promoters of this type are promoters recognized by a 
single sub-unit RNA polymerase, such as the T7 gene 10 promoter, which is recognized by 
the bacteriophage T7 DNA-dependent RNA polymerase. Yet another promoter that is 
capable of expressing a DNA molecule in plant plastids comes from the regulatory region of 
the plastid 16S ribosomal RNA operon (Harris ef a/., MicrobioL Rev, 58:700-754 (1994), 
Shinozaki etaL, EMBO J, 5:2043-2049 (1986), both of which are incorporated herein by 
reference). The gene encoding the T7 polymerase is preferably transfomied into the 
nuclear genome and the T7 polymerase is targeted to the plastids using a plastid transit 
peptide. Expression of the DNA molecules in the plastids can be constitutive or can be 
inducible. These different embodiment are extensively described In WO 98/1 1235, 
incorporated herein by reference. The chimeric gene preferably further comprises a 5' 
untranslated sequence (5* UTR) functional in plant plastids and a plastid gene 3' 
untranslated sequence (3* UTR) operatively linked to a DNA molecule of the present 
invention. Preferably, the 3' UTR is a plastid ^s16 gene 3' untranslated sequence. In a 
further embodiment, the chimeric gene comprises a poly-G tract instead of a 3' untranslated 
sequence. 

The present invention also encompasses a plastid transfonmation vector comprising 
the chimeric gene described above and flanking regions for integration into the plastid 
genome by homologous recombination. The plastid transformation vector may optionally 
comprise at least one chloroplast origin of replication. The present invention also 
encompasses a plant plastid transformed with such a plastid transformation vector, wherein 
the DNA molecule is expressible in the plant plastid. The invention also encompasses a 
plant or plant cell, including the progeny thereof, comprising this plant plastid. In a prefen-ed 
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embodiment, the plant is homoplasmic for transgenic plastids. The plants transformed in the 
present invention may be monocots or dicots. A preferred monocot is maize and a 
preferred dicot is tobacco. Other preferred dicots are tomato and potato. 

In a preferred embodiment, the present invention encompasses a chimeric gene 
comprising a promoter capable of expression in a plant plastid operatively linked to a DNA 
molecule isolated from a prokaryote or a eukaryote that encodes a native or modified protox 
enzyme, such as a DNA molecule that encodes a native or modified wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, or sugar cane protox enzyme. Such a DNA 
molecule is comprised in a plastid transformation vector as described above and plants 
homoplasmic for transgenic plastid genomes are produced. Expression in plant plastids of a 
DNA molecule that encodes a modified protox enzyme preferably confers upon the plant 
tolerance to a herbicide in amounts that inhibit naturally occurring protox activity. 

In a further preferred embodiment, the present invention encompasses a chimeric 
gene comprising (a) a DNA molecule isolated from a plant, which in its native state encodes 
a polypeptide that comprises a plastid transit peptide, and a mature enzyme that is natively 
targeted to a plastid of the plant by the plastid transit peptide, wherein the DNA molecule is 
modified^such that it does not encode a functional plastid transit peptide; and (b) a promoter 
capable of expressing the DNA molecule in a plastid, wherein the promoter is operatively 

-linked to the DNA molecule. In one preferred embodiment, the transit peptide is mutated 
and4hus does not allow the proper transport of the enzyme encoded by the DNA molecule 
to the desired cell compartment, such as the plastid. In another preferred embodiment, a 
pbrtiomof the transit peptide coding sequence or the entire transit peptide coding sequence 

.^is*removed from the DNA molecule, pr^venting the enzyme from being properly targeted to 

r.the^desired cell compartment. 

The chimeric genes described above are inserted in plastid transformation vectors, 
and the present invention is therefore also directed to plants having their plastid genome 
transfomned with such vectors, whereby the DNA molecule is expressible in plant plastids. 
Such plants are preferably homoplasmic for transgenic plastids. 

In a preferred embodiment, a DNA molecule described immediately above encodes 
an enzyme that in its wild-type fomi is inhibited by a herbicide. In a further preferred 
embodiment, the DNA molecule encodes an enzyme that in its wild-type form is inhibited by 
a herbicide, but that comprises at least one amino acid change compared to the wild-type 
enzyme. Such an amino acid change makes the enzyme resistant to compounds that 
naturally inhibit the wild-type enzyme. In a further preferred embodiment, the DNA molecule 
encodes an enzyme having protoporphyrinogen oxidase (protox) activity. In a further 
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preferred embodiment, the transit peptide is removed from the DNA molecule as further 
illustrated in Examples 37-42. Plants homoplasmic for transgenic plastids of the invention 
are resistant to high amounts of herbicides such as Formula XVII that inhibit the naturally 
occurring protox activity (as further illustrated in Example 44). 

In another preferred embodiment, the transit peptide of a DNA molecule encoding a 
5-enolpyruvyl-3-phosphoshikimate synthase (EPSP synthase) is mutated or removed. The 
resulting DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genomes are obtained. These 
plants are resistant to herbicidal compounds that naturally inhibit EPSP synthase, in 
particular glyphosate. In another preferred embodiment, the transit peptide of a DNA 
molecule encoding a acetolactate synthase (ALS> is mutated or removed. The resulting 
DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genome are obtained. These 
plants are resistant to herbicidal compounds that naturally inhibit ALS. in particular 
sulfonylureas. In another preferred embodiment, the transit peptide of a DNA molecule 
encoding a acetoxyhydroxyacid synthase (AHAS) is mutated or removed. The resulting 
DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genome are obtained. These 
plants are resistant to herbicidal compounds that naturally Inhibit AHAS, in particular, 
imidazolinone and sulfonamide herbicides. In another preferred embodiment, the transit 
peptide of a DNA molecule encoding an acetylcoenzyme A carboxylase (ACCase) is 
mutated or removed. The resulting DNA molecule is fused to a promoter capable of 
expression in plant plastids and homoplasmic plants harboring such constructs in their 
plastid genome are obtained. These plants are resistant to herbicidal compounds that 
naturally inhibit ACCase, in particular cyclohexanedione and arylphenoxypropanoic acid 
herbicides. In another preferred embodiment, the transit peptide of a DNA molecule 
encoding a glutamine synthase (GS) is mutated or removed. The resulting DNA molecule is 
fused to a promoter capable of expression in plant plastids and homoplasmic plants 
harboring such constructs in their plastid genome are obtained. These plants are resistant 
to herbicidal compounds that naturally inhibit GS, in particular phosphinothricin and 
methionine sulfoximine. 

The present invention is also further directed to a method of obtaining herbicide- 
resistant plants by transfonning their plastid genome with a chimeric gene comprising (a) a 
DNA molecule isolated from a plant, which in its native state encodes a polypeptide that 
comprises a plastid transit peptide, and a mature enzyme that is natively targeted to a 
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plastid of the plant by the plastid transit peptide, wherein the DNA molecule is modified 
such that it does not encode a functional plastid transit peptide; and (b) a promoter capable 
of expressing the DNA molecule in a plastid, wherein the promoter is operatively linked to 
the DNA molecule. Examples of enzymes that are used in the present invention are cited 
immediately above, but the applicability of such a method is not limited to the cited 
examples. 

The present invention is still further directed to a novel method for selecting a 
transplastomic plant cell, comprising the steps of: introducing the above-described chimeric 
gene into the plastome of a plant cell; expressing the encoded enzyme in the plastids of the 
plant cell; and selecting a cell that is resistant to a herbicidal compound that naturally 
inhibits the activity of the enzyme, whereby the resistant cell comprises transformed 
plastids. In a preferred embodiment, the enzyme is naturally inhibited by a herbicidal 
compound and the transgenic plant is able to grow on an amount of the herbicidal 
compound that naturally inhibits the activity of the enzyme. In a further preferred 
embodiment, the enzyme has protoporphyrinogen oxidase (protox) activity and is modified 
so that it that confers resistance to protox inhibitors. 

A further aspect^of the. present invention -is a novel method for plastid transformation 
of ^recalcitrant plants. The methods pioneered for plastid transformation of tobacco and 
lower plant species rely on non-lethal selection for resistance to antibiotics that 

.^preferentially affect the plastid translational apparatus and hence allow photo-heterotrophic 

' transformants to outgrow heterotrophic, non-transfomied tissue. 

Several factors have likely contributed to the difficulties encountered with plastid 

i trahsfomnation of monocptSvand^other^dicots. rFor example, the maize chloroplast 1 6S 

^ ribpsomal RNA (rRNA) is naturally resistant to spectinomycin because of the presence of a 
G at position 1138 in the Zea mays 16S rDNA gene (Harris ef a/., 1994). Thus, utilization of 
16s rRNA point mutations that confer spectinomycin and/or streptomycin resistance which 
have been used successfully as selectable chloroplast markers in Chlamydomonas and 
tobacco (Boynton and Gillham (1993) /n Wu. R. [Ed.] Methods in EnzymologyVo\ 217. 
Academic Press, San Diego, pp. 510-536; Svab etah (1990) Proc. Natl. Acad. Sci. U.S.A. 
87: 8526-8530) is not feasible for maize. Natural spectinomycin and streptomycin 
resistance in maize also obviates the use of the bacterial aadA gene encoding 
aminoglycoside 3"-adenyltransferase, which results in dominant spectinomycin and 
streptomycin resistance and allows a 100-fold increase in tobacco chloroplast 
transformation efficiency (Svab and Maliga (1993) Proc. Natl Acad. ScL U.SA. 90: 913- 
917). Use of kanamycin (the only other antibiotic proven to be useful for chloroplast 
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transformation) is also problematic due to a large excess (ca. 50:1) of nuclear vs. 
chloroplast-encoded resistance in tobacco following bombardment of the bacterial nptll 
gene encoding neomycin phosphotransferase (Carrer etal. (1993) MoL Gen. Genet 241: 
49-56). This has been shown to result from both a high frequency of spontaneous nuclear 
resistance mutants as well as integration of nptll into the nuclear genome. Since nptll is 
also a highly effective selectable marker for maize nuclear transformation it is reasonable to 
expect similar background levels to that observed in tobacco. Spontaneous resistance and 
a significant excess of selectable marker integration by random, illegitimate recombination 
into the nuclear genome, rather than homologous integration into the chloroptast genome, 
would make recovery of bona fide chloroplast transformants difficult if not impossible. 

A more fundamental reason for the difficulties encountered with plastid 
transfomnation in plant species other than tobacco may have to do with the non- 
photosynthetic nature of many regenerable cultured plant tissues, especially in maize and 
Arabidopsis, Tobacco is an exception in that cultured vegetative tissues are regenerable 
and contain mature differentiated chloroplasts that are photosynthetically competent in the 
presence of sucrose. Consequently, the current system for selecting tobacco plastid 
transformants relies on the faster growth rate of transformed cells that can use both 
reduced and inorganic carbon sources. Moreover, transformed cells do not suffer the 
chloroplast membrane damage that results from inhibition of plastid protein synthesis in the 
light. This expression of selectable markers that act preferentially on photosynthetic cells, 
driven by promoters that have high activity in differentiated chloroplasts, is unlikely to work 
in non-green tissues containing proplastids (e.g. dark-grown maize Type I callus, somatic 
embryos) or amyloplasts/leucoplasts (e.g. Arabldopsis root cultures). Plastid transformation 
in these plants requires a selectable marker that gives strong selection in all plastid types. 

A preferred selectable marker for generalized plastid transformation: (1) is active 
only in the plastid to eliminate nuclear-transformed "escapes"; (2) has a mode of action that 
does not depend on photosynthetic competence or the presence of fully differentiated 
chloroplasts; and (3) has a level of resistance that is co-dependent on an adjustable 
external parameter (e.g. light), rather than being determined solely by the bulk 
concentration of a selective agent, so that selection pressure can vary during selection to 
facilitate segregation of the many-thousand plastid genome copies. 

In a preferred embodiment, such a selectable marker gene involves the use of a 
chimeric gene comprising an isolated DNA molecule encoding a plastid-targeted enzyme 
having in its natural state a plastid transit peptide, wherein the DNA molecule is modified 
such that the transit peptide either is absent or does not function to target the enzyme to the 
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plastid, wherein the DNA molecule is operatively linked to a promoter capable of expression 
in plant plastids. In a preferred embodiment, a DNA molecule of the present invention 
encodes an enzyme that is naturally inhibited by a herbicide. In another preferred 
embodiment, the DNA molecule encodes a protoporphyrinogen IX oxidase ("protox"). In a 
preferred embodiment, the protoporphyrinogen IX oxidase gene is from Arabidopsis 
thaliana and in a more preferred embodiment, the protoporphyrinogen IX oxidase gene is 
from Arabidopsis thaliana and comprises at least one amino acid substitution. Preferably, 
an amino acid substitution results in tolerance of the enzyme against inhibition by an 
herbicide which naturally inhibits the activity of the enzyme. Low concentrations of herbicide 
are thought to kill wildtype plants due to light-sensitive intemiediates which build up when 
the plastid-localized protox enzyme is inhibited! Production of these photosensitizing 
compounds does not require differentiated chloroplasts or active photosynthesis, which is a 
key factor for successful plastid transformation of plants whose regenerable cultured tissues 
are of non-photosynthetic nature. - 

Another key feature is to have expression of the selectable nriarker gene in non- 
green plastids. In a preferred embodiment, the invention encompasses the use of 
promoters that are capable of expression of operatively linked DNA molecules in plastids of 
both green and non-green tissue. In particular, one such, promoter comes from the 
regulatory region of the plastid 16S^ribosomal RNA operon. Another candidate is the 

-promoter and 5-UTR f rom the plastid cIpP gene. The cIpP gene product is expressed 
constitutively in plastids from ^all plant tissues, including those that do not contain 

'-chloroplasts (Shanklin (1995) P/anf Ge// 7: 1713-22). 

Other®NA^moleculesvmay^be^coVmtroduced in plant plastids using the method 
described above, jn a preferred^.embodiment, a plastid transformation vector of the present 
invention contains a chimeric gene allowing for selection of transfomnants as described 
above and at least one other gene fused to a promoter capable of expression in plant 
plastids. The other such gene may, for example, confer resistance to insect pests, or to 
fungal or bacterial pathogens, or may encode one or more value-added traits. 

EXAMPLES 

The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless othenwise specified. Standard recombinant DNA and 
molecular cloning techniques used here are well known in the art and are described by 
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Ausubel (ed.), Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (1994); T. 
Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor laboratory, Cold Spring Harbor, NY (1989); and by TJ. Silhavy, M.L Berman, 
and L.W. Enqulst, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1984). 

Section A. Isolation And Characterization Of Plant 
Protoporphyrinogen Oxidase (Protox) Genes 

Example 1: Isolation of a Wheat Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 

Total RNA prepared from Triticum aestivum (cv Kanzler) was submitted to Clontech 
for custom cDNA library construction in the Lambda Uni-Zap vector. Approximately 50,000 
pfu of the cDNA library were plated at a density of approximately 5,000 pfu per 10 cm Petri 
dish and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and 
Schuell), The plaque lifts were probed with the maize protox-1 cDNA (SEQ ID NO:5; see 
Example 2 of International application no. PCT/IB95/00452, filed June 8, 1995, published 
Dec. 21, 1995 as WO 95/34659) labeled with 32P-dCTP by the random priming method 
(Life Technologies). Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04 pH 7.0, 1 mM EDTA at 50°C. Wash conditions were 2X SSC, 1% SDS at 50°C. 
(Church and Gilbert, Proc, NatL Acad ScL USA 8t; 1991-1995 (1984), hereby incorporated 
by reference in its entirety.) Positively hybridizing plaques were purified and in vivo excised 
into pBluescript plasmids. The sequences of the cDNA inserts were determined by the 
chain termination method using dideoxy terminators labeled with fluorescent dyes (Applied 
Biosystems, Inc.). The longest wheat protox-1 cDNA obtained from initial screening efforts, 
designated "wheat protox-1", was 1489-bp in length. Wheat protox-1 lacks coding 
sequence for the transit peptide plus approximately 126 amino acids of the mature coding 
sequence based on comparison with the other known plant protox peptide sequences. 

A second screen was performed to obtain a longer wheat protox cDNA. For this 
screen, a Triticum aestivum (cv Kanzler) cDNA library was prepared internally using the 
lambda Uni-Zap vector. Approximately 200,000 pfu of the cDNA library was screened as 
indicated above, except that the wheat protox-1 cDNA was used as a probe and 
hybridization and wash conditions were at 65°C instead of 50°C. The longest wheat cDNA 
obtained from this screening effort, designated "wheat protox-1 a", was 1811 -bp in length. 
The nucleotide sequence of this cDNA and the amino acid sequence it encodes are set 
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forth in SEQ ID N0s;9 and 10, respectively. Based on comparison with the other known 
plant protox peptide sequences and with corresponding genomic sequence, this cDNA is 
either full-length or missing only a few transit peptide codons (Table 1 A). This wheat 
protein sequence is 91% identical (95% similar) to the maize protox-1 protein sequence set 
forth in SEQ ID NO:6. 

Wheat protox-1 a, in the pBluescript SK vector, was deposited March 19, 1996, as 
pWDC-13 (NRRL #821545). 

Example 2: Isolation of a Soybean Protox-1 cDNA Based on Sequence Homology to an 

Arabidopsis Protox-1 Coding Sequence 
A Lambda Uni-Zap cDNA library prepared from soybean (v Williams 82, epicotyls) 
was purchased from Stratagene. Approximately 50,000 pfu of the library was plated at a 
density of approximately 5,000 pfu per 10 cm Petri dish and duplicate filter lifts were made 
onto Colony/Plaque Screen membranes (NEN Dupont). The plaque lifts were probed with 
the' >AraZ?/cfops/s protox-1 cDNA (SEQ ID N0:1; see Example 1 of International application 
no. PCT/IB95/00452, filed June 8, 1995, published Dec. 21, 1995 as WO 95/34659)) 
labeled with 32P-dCTP by the random priming method (Life Technologies). Hybridization 
conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 
50° C. Wash conditions were 2X SSG, 1% SDS at 50° C, (Church and Gilbert (1984)). 
Positively hybridizing plaques were^purified and in vivo excised into pBluescript plasmids. 
- The sequence of the cDNA inserts was detemnined by the chain tennination method using 
■^dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.), The longest 
:ti^soybean GDNA.obtained,.designated "soybean is full-length based on comparison 

^^r^with'the other known plant protox.peptide sequences (Table 1 A), Soybean protox-1 is 
' 1847-bp in length and encodes a protein of 58.8 kDa. The nucleotide sequence of this 
cDNA and the amino acid sequence it encodes are set forth in SEQ ID NOs:1 1 and 1 2, 
respectively. The soybean protein is 78% identical (87% similar) to the Arabidopsis protox- 
1 protein. 

Soybean protox-1, in the pBluescript SK vector, was deposited December 15, 1995 
as pWDC-1 2 (NRRL #B-21 51 6). 

Example 3: Isolation of a Cotton Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 
A Lambda Uni-Zap cDNA library prepared from Gossyp/u/r? hirsutum L (72 hr. dark 
grown cotyledons) was obtained from Dr. Dick Trelease, Dept. of Botany, Arizona State 
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University (Ni W. and Trelease R.N., Arch, Biochem, Biophys, 289: 237-243 (1991)). 
Approximately 50,000 pfu of the library was plated at a density of approximately 5,000 pfu 
per 10 cm Petri dish and duplicate filter lifts were made onto Colony/Plaque Screen 
membranes (NEN Dupont). The plaque lifts were probed with the maize protox-1 cDNA 
(SEQ ID N0:5) labeled with 32P-dCTP by the random priming method (Life Technologies). 
Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C. Wash conditions were 2X SSC, 1% SDS at 50° C. (Church and Gilbert 
(1984)). Positively hybridizing plaques were purified and in vivo excised into pBluescript 
plasmids. The sequence of the cDNA inserts was detemnined by the chain temnination 
method using dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest cotton cDNA obtained, designated "cotton protox-l", appears to be full-length 
based on comparison with the other known plant protox peptide sequences (Table 1 A). 
Cotton protox-1 is 1 826-bp in length and encodes a protein of 58.2 kDa. The nucleotide 
sequence of this cDNA and the amino acid sequence it encodes are set forth in SEQ ID 
N0s:13 and 14, respectively. The cotton protein is 77% identical (86% similar) to the maize 
protox-1 protein. 

Cotton protox-1 , in the pBluescript SK vector, was deposited July 1 , 1996 as pWDC- 
15(NRRL #B-21594). 

Example 4: Isolation of a Sugar Beet Protox-1 cDNA Based on Sequence Homology to an 

Arabidopsis Protox-1 Coding Sequence 
A Lambda-Zap cDNA library prepared from Beta vulgaris was obtained from Dr. 
Philip Rea, Dept. of Botany, Plant Science Institute, Philadelphia, PA (Yongcheol Kim. 
Eugene J. Kim, and Philip A. Rea, Plant PhysioL 106: 375-382 (1 994)). Approximately 
50,000 pfu of the cDNA library were plated at a density of approximately 5,000 pfu per 10 
cm Petri dish and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher 
and Schuell). The plaque lifts were probed with the Arabidopsis protox-1 cDNA (SEQ ID 
N0:1) labeled with 32P-dCTP by the random priming method (Life Technologies). 
Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0. 1 
mM EDTA at 50° C. Wash conditions were 2X SSC, 1% SDS at 50° C. (Church and Gilbert 
(1984)). Positively hybridizing plaques were purified and in vivo excised into pBluescript 
plasmids. The sequences of the cDNA inserts were detennined by the chain termination 
method using dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest sugar beet protox-1 cDNA obtained, designated "sugar beet protox-l", is full- 
length based on comparison with the other known plant protox peptide sequences (Table 



69 



wo 01/68826 PCT/EPOl/02753 

1A). Sugar beet protox-1 is 1910-bp in length and encodes a protein of 60 kDa. The 
nucleotide sequence of this cDNA and the amino acid sequence It encodes are set forth in 
SEQ ID NOs:15 and 1 6, respectively. The sugar beet protein is 73% identical (82% similar) 
to the Arabidopsis protox-1 protein. 

Sugar beet protox-1, in the pBluescript SK vector, was deposited July 29, 1996, as 
pWDC-1 6 (NRRL #B-21 595N). 

Example 5: Isolation of an Oilseed Rape Protox-1 cDNA Based on Sequence Homology to 

an Arabidopsis Protox-1 Coding Sequence 
A Lambda Uni-Zap II cDNA library prepared from Brassica napus (3-4 wk. mature 
green leaves) was obtained from Dr. Guenther Ochs, Institut Fuer Allgemeine Botanik, 
Johannes Gutenberg-Universitaet Mainz, Germany (Gunther Ochs, Gerald Schock, and 
Aloysius Wild, Plant Physiol. A03: 303-304 (1993)). Approximately 50,000 pfu of the cDNA 
library were plated at a density of approximately 5,000 pfu per 10 cm Petri dish and 
duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). The 
plaque lifts were probed with the Arabidopsis protox-1 cDNA (SEQ ID N0:1) labeled with 
32P-dCTP by the random priming method (Life Technologies). Hybridization conditions 
were Wo sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash 
conditions were 2X SSC, 1% SDSat 50°'C. (Church and Gilbert (1984)). Positively 
> hybridizing plaques were purif ied^-and in ;vivo excised into, pBluescript plasmids. The 
sequences of the cDNA inserts were detemnined by the chain tennination method using 
dideoxy-terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
icioilseedfape protox-l^.cDNA obtained,^ designated "rape;protox-r, is full-length based on 
^.comparison with the. other known plant protox peptide sequences (Table 1 A). Rape protox- 
1 is 1784-bp In length and encodes a protein of 57.3kD. The nucleotide sequence of this 
cDNA and the amino acid sequence it encodes are set forth in SEQ ID NOs: 17 and 1 8, 
respectively. The oilseed rape protein is 87% identical (92% similar) to the Arabidopsis 
protox-1 protein. 

Rape protox-1, in the pBluescript SK vector, was deposited August 23, 1996, as 
pWDC-17 (NRRL #B-21615). 

Example 6: Isolation of a Rice Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 
A Lambda gtl 1 cDNA library prepared from Oryza sativa (5 day etiolated shoots) 
was purchased from Clontech. Approximately 50,000 pfu of the cDNA library were plated at 
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a density of approximately 5,000 pfu per 10 cm Petri dish and duplicate filter lifts were made 
onto nitrocellulose membranes (Schleicher and Schuell). The plaque lifts were probed with 
the maize protox-1 cDNA (SEQ ID N0:5) labeled with 32P-dCTP by the random priming 
method (Life Technologies). Hybridization conditions were 7% sodium dodecyl sulfate 
(SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash conditions were 2X SSC, 1% 
SDS at 50° C. (Church and Gilbert (1984)). Positively hybridizing plaques were purified, 
and lambda DNA was prepared using the Wizard Lambda-Prep kit (Promega). The cDNA 
inserts were subcloned as EcoR/ fragments into the pBluescript SK vector using standard 
techniques. The sequences of the cDNA inserts were determined by the chain termination 
method using dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest rice protox-1 cDNA obtained, designated "rice protox-1", was 1224-bp in length. 
Rice protox-1 lacks coding sequence for the transit peptide plus approximately 172 amino 
acids of the mature coding sequence based on comparison with the other known plant 
protox peptide sequences (Table 1 A). The nucleotide sequence of this partial cDNA and 
the amino acid sequence it encodes are set forth in SEQ ID N0s:19 and 20, respectively. 

Rice protox-1 , in the pBluescript SK vector, was deposited December 6, 1 996, as 
pWDC-18 (NRRL #B-21648). 

Example 7: Isolation of a Sorghum Protox-1 cDNA Based on Sequence Homology to a 

Maize Protox-1 Coding Sequence 
A Lambda-Zap II cDNA library prepared from Sorghum bicolor (3-6 day green 
seedlings) was obtained from Dr. Klaus Pfizenmaier, Institute of Cell Biology and 
Immunology, University of Stuttgart, Germany (Harald Wajant, Karl-Wolfgang Mundry, and 
Klaus Pfizenmaier, Plant Mol. Biol. 26: 735-746 (1994)). Approximately 50,000 pfu of the 
cDNA library were plated at a density of approximately 5.000 pfu per 1 0 cm Petri dish and 
duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). The 
plaque lifts were probed with the maize protox-1 cDNA (SEQ ID N0:5) labeled with 32P- 
dCTP by the random priming method (Life Technologies). Hybridization conditions were 7% 
sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash 
conditions were 2X SSC, 1% SDS at 50° C. (Church and Gilbert (1984)). Positively 
hybridizing plaques were purified and in vivo excised into pBluescript plasmids. The 
sequences of the cDNA inserts were determined by the chain termination method using 
dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
sorghum protox-1 cDNA obtained, designated "sorghum protox-1", was 1590-bp in length. 
Sorghum protox-1 lacks coding sequence for the transit peptide plus approximately 44 
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amino acids of the mature coding sequence based on comparison with the other known 
plant protox peptide sequences (Table 1 A). The nucleotide sequence of this partial cDNA 
and the amino acid sequence it encodes are set forth in SEQ ID N0s:21 and 22, 
respectively. 

Sorghum protox-1, in the pBluescript SK vector, was deposited December 6, 1996, 
as pWDC-19 (NRRL #B-21649). 

Example 8: Isolation of a Sugar Cane Protox-1 cDNA Based on Sequence Homology to a 

Maize Protox-1 Coding Sequence 
A Lambda-Zap II cDNA library prepared from sugar cane was obtained from Henrik 
Albert of USDA/ARS at the Hawaii Agricultural Research Center. Approximately 50,000 pfu 
of the cDNA library were plated at a density of approximately 5,000 pfu per 10 cm Petri dish 
and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). 
The plaque lifts were probed with the maize protox-1 cDNA (SEQ ID NO:5) labeled with 
>32P-dCTP by the random priming method (Life Technologies).: Hybridization conditions 
^were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash 
- conditions were ^2XrSSC, 1 % SDS.at 50"" C. (Church and Gilbert (1 984)). Positively 
: hybridizing plaques.were purified and in vivo excised into pBluescript plasmids. The 
sequences of the cDNA inserts were determined by the chain terniination method using 
dideoxy terminators^labeled with fluorescent dyes (Applied.Biosystems, Inc.). The longest 
sugar cane protox-1 cDNA obtained, designated "sugar cane protox-r, was 633-bp in 
length. ' Sugar eane protox-1 lacks codingrsequence for the transit peptide plus 
approximately '382 amino acids of the mature coding sequence-based on comparison with 
theiOther known plant protox peptide sequences (Table 1 A). The nucleotide sequence of 
this partial cDNA and the amino acid sequence it encodes are set forth in SEQ ID NOs:36 
and 37, respectively. 

Example 9: Demonstration of Plant Protox Clone Sensitivity to Protox Inhibitory Herbicides 

in a Bacterial System 
Liquid cultures of protox-1 /SASX38, protox-2/SASX38 and pBluescript/XLl-Blue 
were grown in L amp^°*^. One hundred microliter aliquots of each culture were plated on L 
amp^^ media containing various concentrations (1.0nM-10mM) of a protox inhibitory 
aryluracil herbicide of formula XVII. Duplicate sets of plates were incubated for 18 hours at 
37** C. 
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The protox+ E. co// strain XL1-Blue showed no sensitivity to the herbicide at any 
concentration, consistent with reported resistance of the native bacterial enzyme to similar 
herbicides The protox-1/SASX38 was clearly sensitive, with the lawn of bacteria almost 
entirely eliminated by inhibitor concentrations as low as 10nM. The protox-2/SASX38 was 
also sensitive, but only at a higher concentration (10^M) of the herbicide. The herbicide was 
effective even on plates maintained almost entirely in the dark. The toxicity of the herb.cde 
was entirely eliminated by the addition of 20 pg/ml hematin to the plates. 

The different herbicide tolerance between the two plant protox strains is likely the 
result of differential expression from these two plasmids. rather than any inherent difference 
in enzyme sensitivity. Protox-1/SASX38 grows much more slowly than protox-2/SASX38 in 
any heme-deficient media. In addition, the Mzprotox-2/SASX38 strain, with a growth rate 
comparable to Arab protox.1/SASX38. is also very sensitive to herbicide at the lower (10- 
1 0OnM) concentrations. 

Section B: Identification and Characterization of Plant Protox Genes 

Resistant to Protox-lnhibitory Herbicides 

Example 10: Selecting for Plant Protox Genes Resistant to Protox-lnhibitory Herbicides in 

the E coli Expression System 
An Arabidopsis thaliana (Landsberg) cDNA library in the plasmid vector pFL61 
(Minet etah. Plant J. 2:417-422 (1992) was obtained and amplified. The £ coli hemG 
mutant SASX38 (Sasamian et a/.. J. Gen. Microbiol. 1 1 3:297(1 979)) was obtained and 
maintained on L media containing 20ug/ml hematin (United States Biochemicals). The 
plasmid library was transfomied into SASX38 by electroporation using the Bio-Rad Gene 
Pulser and the manufacturer's conditions. The electroporated cells were plated on L agar 
containing lOOug/ml ampicillin at a density of approximately 500.000 transformants/1 0cm 
plate The cells were then incubated at 37°C for 40 hours in low light and selected for the 
ability to grow without the addition of exogenous heme. Heme prototrophs were recovered 
at a frequency of 400/1o' from the pFL61 library. Sequence analysis of twenty-two 
complementing clones showed that nine are of the type designated "protox-1 the protox 
gene expected to express a chloroplastic protox enzyme. 

The pFL61 library is a yeast expression library, with the Arabidopsis cDNAs inserted 
bidirectionally. These cDNAs can also be expressed in bacteria. The protox cDNAs 
apparently initiate at an in-frame ATG in the yeast PGK 3' sequence approximately 10 
amino acids 5- to the NotI cloning site in the vector and are expressed either from the lacZ 
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promoter 300bp further upstream or from an undefined cryptic bacterial promoter. Because 
protox-1 cDNAs that included significant portions of a chloroplast transit sequence inhibited 
the growth of the E. co// SASX38 strain, the clone with the least amount of chloroplast 
transit sequence attached was chosen for mutagenesis/herbicide selection experiments. 
This clone, pSLV19, contains only 17 amino acids of the putative chloroplast transit peptide, 
with the DNA sequence beginning at bp-151 of the Arabidopsis protox-1 cDNA (SEQ ID 
N0:1). 

The plasmid pSLV19 was transfomned into the random mutagenesis strain XL1-Red 
(Stratagene, La Jolla, CA), The transformation was plated on L media containing 50ug/ml 
ampicillin and incubated for 48 hours at 37°C. Lawns of transformed cells were scraped 
from the plates and plasmid DNA prepared using the Wizard Megaprep kit (Promega, 
Madison, Wl). Plasmid DNA isolated from this mutator strain is predicted to contain 
approximately one random base change per 2000 nucleotides (see Greener ef a/., 
Sfrafegf/es7(2):32-34 (1994). 

The mutated plasmid DNA was transfonned into the hemG mutant SASX38 
(Sasarman et a/., J. Gen. MicrobioL 1 13:297 (1 979) and plated on L media containing 
various concentrations of protox-inhibiting herbicide (formula XVII). The plates were 
incubated for 2 'days^at-37°C. Plasmid DNA was isolated from all colonies that grew in the 
• presence of herbicide concentrations that effectively killed the wild type strain. The isolated 
DNA. was^then^ransformed into SASX38 and plated again oh herbicide to ensure that the 
resistance observed was plasmid-bome. The protox coding sequence from plasmids 
passing this^screen' was excised by A/of/ digestion, recloned into an unmutagenized vector, 
*and tested: again fdpthe ability to*confer herbicide tolerancei The DNA sequence of protox 
>.cDNAs.ithat conferred herbicide resistance was' then determined and mutations identified by 
- comparison with the wild type Arabidopsis protox-1 sequence (SEQ ID N0:1). 

A single coding sequence mutant was recovered from the first mutagenesis 
experiment This mutant leads to enhanced heriDicide "resistance" only by increasing 
growth rate. It contains a C to A mutation at nucleotide 197 in SEQ ID NO:1 in the 
truncated chloroplast transit sequence of pSLV19, converting an ACG codon for threonine 
to an AAG codon for lysine at amino acid 56 of SEQ ID NO:2, and resulting in better 
complementation of the bacterial mutant. This plasmid also contains a silent coding 
sequence mutation at nucleotide 1059, with AGT (Ser) changing to AGO (Ser). This 
plasmid was designated pMut-1 . 

The pMut-1 plasmid was then transformed into the mutator XLI-Red strain as 
described above and the mutated DNA was isolated and plated on an heriDicide 
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concentration that is lethal to the unmutagenized pMut-1 protox gene. Herbicide tolerant 
colonies were isolated after two days at a/c and analyzed as described above. Multiple 
plasmids were shown to contain herbicide resistant protox coding sequences. Sequence 
analysis indicated that the resistant genes fell into two classes. One resistance mutation 
identified was a C to T change at nucleotide 689 in the Arabidopsis protox-1 sequence set 
forth in SEQ ID NO:1 . This change converts a GCT codon for alanine at amino acid 220 of 
SEQ ID N0:2 to a GTT codon for valine, and was designated pAraC-1 Val (see, Table 1 B; 
sub-sequence 3). 

A second class of herbicide resistant mutant contains an A to G change at 
nucleotide 1 307 in the Arabidopsis protox-1 sequence. This change converts a TAG codon 
for tyrosine at amino acid 426 to a TGG codon for cysteine, and was designated pAraC- 
2Cys (see, Table 1 B; sub-sequence 7). 

A third resistant mutant has a G to A change at nucleotide 691 in the Arabidopsis 
protox-1 sequence. This mutation converts a GGT codon for glycine at amino acid 221 to 
an AGT codon for serine at the codon position adjacent to the mutation in pAraC-1 . This 
plasmid was designated pAraC-3Ser {see, Table 1B; sub-sequence 4). 

Resistant mutant pAraC-2Cys, in the pMut-1 plasmid. was deposited on November 
14, 1994 under the designation pWDG-7 with the Agricultural Research Culture Collection 
and given the deposit designation NRRL #21 339N. 

Example 1 1 : Additional Herbicide-Resistant Codon Substitutions at Positions Identified in 

the Random Screen 

The amino acids identified as herbicide resistance sites in the random screen are 
replaced by other amino acids and tested for function and for herbicide tolerance in the 
bacterial system. Oligonuclebtide-directed mutagenesis of the Arabidopsis protox-1 
sequence is perfomied using the Transformer Site-Directed Mutagenesis Kit (Clontech. 
Palo Alto, CA). After amino acid changes are confirmed by sequence analysis, the mutated 
plasmids are transformed into SASX38 and plated on L-amp^» media to test for function 
and on various concentrations of protox-inhibiting herbicide to test for tolerance. 

This procedure is applied to the alanine codon at nucleotides 688-690 and to the 
tyrosine codon at nucleotides 1 306-1 308 of the Arabidopsis protox-1 sequence (SEQ ID 
N0:1). The results demonstrate that the alanine codon at nucleotides 688-690 can be 
changed to a codon for valine (pAraC-IVal). threonine (pAraC-IThr), leucine (pAraC-ILeu). 
cysteine (pAraC-ICys). or isoleucine (pAraC-llle) to yield an herbicide-resistant protox 
enzyme that retains function [see, Table 1 B; sub-sequence 3). The results further 
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demonstrate that the tyrosine codon at nucleotides 1306-1308 can be changed to a codon 
for cysteine (pAraC-2Cys). isoleucine (pAraC-2lle), leucine (pAraC-2Leu), threonine (pAraC- 
2Thr), methionine (pAraC-2Met), valine (pAraC-2Val), or alanine (pAraC-2Ala) to yield an 
herbicide-resistant protox enzyme that retains function (see. Table 1 B; sub-sequence 7). 

Example 12: Isolation of Additional Mutations that Increase Enzyme Function and/or 
Herbicide Tolerance of Previously Identified Resistant Mutants 
Plasmids containing herbicide resistant protox genes are transfomried into the 
mutator strain XLI-Red and mutated DNA is isolated as described above. The mutated 
plasmids are transfomned into SASX38 and the transfomiants are screened on herbicide 
concentrations (fomaula XVII) sufficient to inhibit growth of the original "resistant" mutant. 
Tolerant colonies are isolated and the higher tolerance phenotype is verified as being 
coding sequence dependent as described above. The sequence of these mutants is 
detenmined and mutations identified by comparison to the progenitor sequence. 

This procedure was applied to the pAraC-IVal mutant described above. The results 
demonstrate that the serine codon at amino acid 305 (SEQ ID NO:2) can be changed to a 
codon.for leucine to yield an enzyme with higher tolerance to protox-inhibiting herbicides 

r*than the.pAraC-1Val mutant alone. This second site mutation is designated AraC305Leu 
-(see. Table 1 B; sub-sequence 13). The same results are demonstrated for the threonine 

-- codon at amino acid 249, where a change to either isoleucine or to alanine leads to a more 
tolerant enzyme (see. Table IB; sub"Sequence^12). These changes are designated 
AraG249lle and AraC249Ala, respectively. - 

"The procedure was also-applied to the pAraC-2Cys mutant described above. The 
results.demonstrate that the proline codon at amino acid 11 8 (SEQ ID N0:2) can be 
changed to a codon for leucine to yield an enzyme with higher tolerance to protox-inhibiting 
herbicides than the pAraC-ICys mutant alone. This mutation is designated AraCI 18Leu 
(see, Table 1 B; sub-sequence 11). The same results are demonstrated for the serine 
codon at amino acid 305, where a change to leucine leads to a more tolerant pAraC-2Cys 
enzyme (see, Table IB; sub-sequence 13). This change was also isolated with the pAraC- 
1 Val mutant as described above and is designated AraC305Leu. Additional mutations that 
enhance the herbicide resistance of the pAraC-2Cys mutant include an asparagine to 
serine change at amino acid 425, designated AraC425Ser (Table IB; sub-sequence 14), 
and a tyrosine to cysteine at amino acid 498, designated AraC498Cys (Table IB; sub- 
sequence 15). 
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These changes (Table 1B; sub-sequences 11-15) are referred to as "second site- 
mutations, because they are not sufficient to confer herbicide tolerance alone, but rather 
enhance the function and/or the herbicide tolerance of an already mutant enzyme. This 
does not preclude the possibility that other amino acid substitutions at these sites could 
suffice to produce an herbicide tolerant enzyme since exhaustive testing of all possible 
replacements has not been perfomned. 

Example 13: Combining Identified Resistance Mutations with Identified Second Site 
Mutations to Create Highly Functional/Highly Tolerant Protox Enzymes 
The AraC305Leu mutation described above was found to enhance the 
function/herbicide resistance of both the AraC-1 Val and the AraC-2Cys mutant plasmids. In 
an effort to test the general usefulness of this second site mutation, it was combined with 
the AraC-2Leu, AraC-2Val, and AraC-2lle mutations and tested for herbicide tolerance. In 
each case, the AraC305Leu change significantly increased the growth rate of the resistant 
protox mutant on protox-inhibiting herbicide. Combinations of the AraC-2lle resistant 
mutant with either the second site mutant AraC249ile or AraC1 1 8Leu also produced more 
highly tolerant mutant protox enzymes. The AraC2491le mutation demonstrates that a 
second site mutation identified as enhancing an AraC-1 (sub-sequence 3) mutant may also 
increase the resistance of an AraC-2 (sub-sequence 7) mutant. A three mutation plasmid 
containing AraG-2lle. AraC305Leu. and AraC249lle (Table 1 B; sub-sequences 7, 13, and 
12) has also been shown to produce a highly functional, highly herbicide tolerant protox-1 

enzyme. 

Example 14: Identification of Sites in the Maize Protox-1 Gene that Can Be Mutated to Give 

Herbicide Tolerance 

The pMut-1 Arabidopsis protox -1 plasmid described above is very effective when 
used in mutagenesis/screening experiments in that it gives a high frequency of genuine 
coding sequence mutants, as opposed to the frequent up-promoter mutants that are 
isolated when other plasmids are used. In an effort to create an efficient plasmid screening 
system for maize protox-1 , the maize cDNA was engineered into the pMut-1 vector in 
approximately the same sequence context as the Arabidopsis cDNA. Using standard 
methods of overlapping PCR fusion, the 5' end of the pMut-1 Arabidopsis clone (including 
17 amino acids of chloroplast transit peptide with one mis-sense mutation as described 
above) was fused to the maize protox-1 cDNA sequence starting at amino acid number 14 
of the maize sequence (SEQ ID N0:6). The 3' end of the maize cDNA was unchanged. 
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NotI restriction sites were placed on both ends of this fusion, and the chimeric gene was 
cloned into the pFL61 plasmid backbone from pMut-1 . Sequence analysis revealed a single 
nucleotide PCR-derived silent mutation that converts the ACG codon at nucleotides 745- 
747 (SEQ ID NO:5) to an ACT codon, both of which encode threonine. This chimeric Arab- 
maize protox-1 plasmid was designated pMut-3. 

The pMut-3 plasmid was transfomned into the mutator XL1-Red strain as described 
above and the mutated DNA was isolated and plated on a herbicide concentration (formula 
XVII) that was lethal to the unmutagenized pMut-3 maize protox gene. Herbicide tolerant 
colonies were isolated after two days at 37°C and analyzed as described above. This 
analysis revealed multiple plasmids containing herbicide resistant protox coding sequences. 
Sequence analysis showed 5 single base changes that individually result in an herbicide 
tolerant maize protox-1 enzyme. Three of these mutations correspond to amino acid 
changes previously shown to confer tolerance at the homologous position in the 
Arabidopsis protox-1 gene. Two of the three are pMzC-1 Val and pMzC-1Thr. converting the 
alanine (GCT) at amino acid 164 (SEQ ID N0:6) to either valine (GAT) or to threonine 
(ACT). This position corresponds to the pAraC-1 mutations described above (see, Table 
:1B; sub-sequence 3). The third analogous change, pMzC-3Ser, converts the.glycine (GGT) 
• at amino acid 165. to Serine (AGT), corresponding to the AraC-3Ser mutation described 
'"above.(see. Table. I B; sub-sequence 4). These results serve to validate the expectation 
^that^herbicide-tolerant mutationsHdentified in one plant protox gene may also confer 
herbicide^tolerance in an equivalent plant protox gene from another species. 

Two^df the-mutations isolated from' the maize protox-1 screen result in amino acid 
changes^at'residues^-not previously^ldehtifiedias herbicide resistance sites. 'One change 
(Mz159Bhe) converts cysteine (TGC). to phenylalanine (TTC) at amino acid 159 of the 
maize protox-1 sequence (SEQ ID N0:6) (see, Table IB; sub-sequence 2). The second 
(Mz419Thr) converts isoieucine (ATA) to threonine (ACA) at amino acid 419 (see. Table IB; 
sub-sequence 9). 

Additional amino acid substitutions were made and tested at three of the maize 
mutant sites. Tolerance was demonstrated when glycine 165 was changed to leucine 
(pMzC-SLeu) or when cysteine 159 was changed to either leucine (Mz159Leu) or to lysine 
(Mz159Lys) (see. Table 1 B; sub-sequences 4 and 2). Tolerant enzymes were also created 
by changing isoieucine 419 to histidine (Mz419His), glycine (Mz419Gly), or asparagine 
(Mz419Asn) {see, Table IB; sub-sequence 9). 

Individual amino acid changes that produced highly herbicide tolerant Arabidopsis 
protox-1 enzymes were engineered into the maize protox-1 gene by site-directed 
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mutagenesis as described above. Bacterial testing demonstrated that changing the alanine 
(GCT) at amino acid 164 (SEQ ID N0:6) to leucine (CTT) produced a highly tolerant maize 
enzyme (pMzC-ILeu) (see, Table 1B; sub-sequence 3). No mutation analogous to the 
AraC-2 site (Table 18; sub-sequence 7) in Arabidopsis ^Nas isolated in the maize random 
screen. However, changing this site, tyrosine 370 in the maize enzyme (SEQ ID NO:6), to 
either isoleucine (pMzC-2lle) or methionine (pMzC-2Met) did produce herbicide tolerant 
enzymes (see. Table IB; sub-sequence 7). 

Additional mutant screens perfomned as described eariier in this example, except 
using formulas XXIlla and XXlIlb instead of XVII. Identified three additional amino acid 
changes that confer tolerant protox enzymes. One, using formula XXIIIb, demonstrated that 
changing the arginine (CGT) at amino acid 88 (SEQ ID NO:6) to cysteine (TGT) produced a 
highly tolerant maize enzyme (Mz88Cys) (see, Table 1B; sub-sequence 1). Another, using 
formula IVc. demonstrated that changing the arginine (CGT) at amino acid 88 (SEQ ID 
NO:6) to leucine (CTT) produced a highly tolerant maize enzyme (Mz88Cys) (see. Table 1 B; 
sub-sequence 1). Another, using formula XXIlla, demonstrated that changing both the 
leucine (TTA) at amino acid 347 (SEQ ID NO:6) to serine (TCA) and the alanine (GCA) at 
amino acid 453 (SEQ ID N0:6) to threonine (ACA) produced a highly tolerant maize 
enzyme (Mz347Ser453Thr) (see. Table IB; sub-sequences 16 and 17). Unlike the second 
site mutations described above, which increase enzyme function and/or heriDicide tolerance 
of previously identified resistant mutants. Mz347Ser453Thr is a "double mutanf that 
requires that both mutations be present for herbicide tolerance. 

Additional mutant screens perfomned as described eariier in this example identified 
two additional amino acid changes that confer tolerant protox enzymes. One. using fomnula 
XXlVv demonstrated that changing the alanine (GCA) at amino acid 175 (SEQ ID NO:6) to 
valine (GTA) or to threonine (ACA) produced a highly tolerant maize enzyme (Mz175Val 
and Mz175Thr, respectively) {see. Table IB; sub-sequence 18). Another, using formula IVc. 
demonstrated that changing the leucine (TTG) at amino acid 337 (SEQ ID N0:6) to serine 
(TCG) produced a highly tolerant maize enzyme (Mz337Ser) (see, Table 1B; sub-sequence 

19). 

Example 15: Identification of Sites in the Wheat Protox-1 Gene that can be Mutated to Give 

Herbicide Tolerance 

To create an efficient plasmid screening system for wheat protox-1, the wheat cDNA 
was engineered into the pMut-1 vector as described above for the maize cDNA. This 
chimeric Arab-wheat protox-1 plasmid is designated pMut-4. The pMut-4 DNA was 
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mutated and screened for herbicide tolerance as described above. This analysis revealed 
multiple plasmids containing herbicide resistant protox coding sequences. Sequence 
analysis showed 7 single base changes that individually result in an herbicide tolerant 
wheat protox-1 enzyme. Four of these mutations correspond to amino acid changes 
previously shown to confer tolerance at the homologous position in the Arabidopsis an6/or 
in the maize protox-1 gene. Two, pWhtC-1 Val and pWhtC-1Thr, convert the alanine (GCT) 
at amino acid 21 1 (SEQ ID NO:10) to valine (GAT) and to threonine (ACT), respectively. 
This position corresponds to the pAraC-1 mutations described above (see, Table IB; sub- 
sequence 3). The third analogous change, pWhtG-3Ser, converts the glycine (GGT) at 
amino acid 212 to serine (AGT), conresponding to the AraC-3Ser mutation described above 
{see, Table IB; sub-sequence 4). The fourth. Wht466Thr, converts isoleucine (ATA) to 
threonine (AGA) at amino acid 466, con-esponding to the Mz419Thr mutant from maize 
(see. Table 1 B; sub-sequence 9). 

Three of the mutations isolated from the wheat protox-1 screen result in amino acid 
changes at residues not previously identified as herbicide resistance sites. One change 
(Wht356Leu) converts valine (GTT) to leucine (CTT) at amino-acid 356 of the wheat protox- 
1 sequence (SEQ ID NO:tO) {see, Table I B; sub-sequence 6). A second (Wht421 Pro) 
converts serine (TCT) to proline (CCT) at amino acid 421 (see. Table 1B; sub-sequence 8). 
The third (Wht502Ala) converts valine (GTT) to alanine (GCT) at amino acid 502 (see, 
Table-i'lBi^ub-sequence 10). , . , : . 

Example 16: - Identification of Sites in the Soybean Protox-1 Gene that can be Mutated to 

Give Herbicide Tolerance • . 
^To create aniefficient plasmid screening system for soybean protox-1 , the soybean 
cDNA was engineered into the pMut-1 vector as described above for the maize cDNA. This 
chimeric Arab-soybean protox-1 plasmid is designated pMut-5. The pMut-5 DNA was 
mutated and screened for herbicide tolerance as described above. This analysis revealed 
multiple plasmids containing herbicide resistant protox coding sequences. Sequence 
analysis showed 4 single base changes that individually result in an herbicide tolerant 
soybean protox-1 enzyme. Two of these mutations correspond to amino acid changes 
previously shown to confer tolerance at the homologous position in the Arabidopsis and/or 
in the wheat protox-1 gene. One, pSoyC-IThr, converts the alanine (GCA) at amino acid 
226 (SEQ ID NO:12) to threonine (ACA). This position corresponds to the pAraC-IThr 
mutation described above {see, Table IB; sub-sequence 3). The second analogous 
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Change, Soy517Ala, converts the valine (GTT) at amino acid 517 to alanine (GCT), 
corresponding to the Wht502Ala mutation Irom wheat (see. Table 1B; sub-sequence 10). 

Two of the mutations isolated from the soybean protox-1 screen result in amino acid 
changes at a residue not previously identified as an heiticide resistance site. One change 
(Soy369Ser) converts proline (CCT) to serine (TCT) at amino acid 369 of the soybean 
protox-1 sequence (SEQ ID NO:12) (see. Table IB; sub-sequence 5). A second 
(Soy369His) converts this same proline369 to histidine (CAT) (see, Table IB; sub-sequence 

5). 

Individual amino acid changes that produced highly herbicide tolerant Arabidopsis 
protox-1 enzymes were engineered into the soybean protox-1 gene by site directed 
mutagenesis as described above. Bacterial testing demonstrated that changing the alanine 
(GCA) at amino acid 226 (SEQ ID N0:12) to leucine (pSoyC-1Leu) produced a tolerant 
soybean enzyme (see. Table 1 B; sub-sequence 3). Changing the tyrosine (T AC) at amino 
acid 432 (SEQ ID N0:12) to either leucine (pSoyG-2Leu) or isoleucine (pSoyC-2lle) also 
produced herbicide tolerant enzymes (see, Table 1 B; sub-sequence 7). 

Example 17: Identification of Sites in the Sugar Beet Protox-1 Gene that can be Mutated to 

Give Herbicide Tolerance 
To create an efficient plasmid screening system for sugar beet protox-1 , the sugar 
beet cDNA was engineered into the pMut-1 vector as described above for the maize cDNA. 
This chimeric Arab-sugar beet protox-1 plasmid is designated pMut-6. The pMut-6 DNA 
was mutated and screened for herbicide tolerance as described above. This analysis 
revealed multiple plasmids containing heriDicide resistant protox coding sequences. 
Sequence analysis showed a single base change that results in an herbicide tolerant sugar 
beet protox-1 enzyme. This change (pSugC-2Cys) converts tyrosine (TAC) at amino acid 
449 to cysteine (TGC) and is analogous to the AraC-2 mutations in Arabidopsis (see, Table 

IB; sub-sequence 7). 

Individual amino acid changes that produced highly heriDicide tolerant Arabidopsis 
protox-1 enzymes were engineered into the sugar beet protox-1 gene by site directed 
mutagenesis as described above. Bacterial testing demonstrated that changing the 
tyrosine (TAC) at amino acid 449 to leucine (pSugC-2Leu). isoleucine (pSugC-2IIe), valine 
(pSugC-2Val), or methionine (pSugC-2Met) produced hertsicide tolerant sugar beet 
enzymes (see, Table 1 B; sub-sequence 7). 
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Example 18: Identification of Sites in the Cotton Protox-1 Gene that can be Mutated to Give 

Herbicide Tolerance 

in an effort to create an efficient plasmid screening system for cotton protox-1, the 
cotton cDNA was engineered into the pMut-1 vector as described above for the maize 
cDNA. This chimeric Arab-cotton protox-1 plasmid is designated pMut-7. The pMut-7 DNA 
was mutated and screened for herbicide tolerance as described above. This analysis 
revealed multiple plasmids containing herbicide resistant protox coding sequences. 
Sequence analysis showed 3 single base changes that individually result in an herbicide 
tolerant cotton protox-1 enzyme. Three mutants. pCotC-2Cys, pCotC-2His and pCotC- 
2Arg, change tyrosine (TAG) at amino acid 428 (SEQ ID N0:16) to cysteine (TGC), histidine 
(CAC) and to arginine (CGC). respectively {see, Table 1 B; sub-sequence 7). Arginine is a 
novel substitution giving tolerance at this previously identified AraC-2 (sub-sequence 7) site. 
The third mutation (Cot365Ser) converts proline (CCT) to serine (TCT) at amino acid 365. 
This change corresponds to the soybean mutant Soy369Ser (see, Table IB; sub-sequence 
5). . 

i Example 1 9: Demonstration ot Resistant Mutations' Cross-Tolerance to Various Protox- 

Inhibiting Compounds 
Resistant mutant plasmids.-originally identified-based on resistance against a single 
^protox^^inhibitory herbicide, were tested against a spectaim of other protox inhibiting 
compounds. For this test, the SASX38 strain containing the wild-type plasmid is plated on a 
trangc' of ^concentrations of each compound to detenmine the lethal concentration for each 
:i^ne:v''Resistant mutant plasrhids in: SASX38 are plated* and- scored for the ability to sun^ive 
..^on.a concentration.of each compound at least 10 fold higher than the concentration that is 
-lethal to the SASX38 strain containing the wild-type plasmid. 

Results from bacterial cross-tolerance testing, illustrated in Tables 3A and 3B. show 
that each of the mutations identified confers tolerance to a variety of protox inhibiting 
compounds. 

Section C: Expression of Herbicide-Resistant Protox Genes in Transgenic Plants 

Example 20: Engineering of Plants Tolerant to Protox-lnhibiting HertDicides by Homologous 

Recombination or Gene Conversion 
Because the described mutant coding sequences effectively confer herbicide 
tolerance when expressed under the control of the native protox promoter, targeted 
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Changes to the protox coding sequence in its native chromosomal location represent an 
alternative means for generating herbicide tolerant plants and plant cells. A fragment of 
protox DNA containing the desired mutations, but lacking its own expression signals (either 
promoter or 3' untranslated region) can be introduced by any of several art-recognized 
methods (for instance, Agrobacterium transformation, direct gene transfer to protoplasts, 
microprojectile bombardment), and herbicide-tolerant transformants selected. The 
introduced DNA fragment also contains a diagnostic restriction enzyme site or other 
sequence polymorphism that is introduced by site-directed mutagenesis in vitro without 
changing the encoded amino acid sequence (i.e. a silent mutation). As has been previously 
reported for various selectable marker and herbicide tolerance genes (see, e.g., 
Paszkowski etal., EMBOJ. 7: 4021-4026 (1988); Lee etal., Plant Cell 2: 415-425 (1990); 
Risseeuw et ai. Plant J. 7: 109-1 1 9 (1995)). some transformants are found to result from 
homologous integration of the mutant DNA into the protox chromosomal locus, or from 
conversion of the native protox chromosomal sequence to the introduced mutant sequence. 
These transformants are recognized by the combination of their herbicide-tolerant 
phenotype, and the presence of the diagnostic restriction enzyme site in their protox 
chromosomal locus. 

Example 21 : Construction of Plant Transformation Vectors 
Numerous transfomiation vectors are available for plant transfonmation, and the 
genes of this invention can be used in conjunction with any such vectors. The selection of 
vector for use will depend upon the preferred transformation technique and the target 
species for transformation. For certain target species, different antibiotic or herbicide 
selection markers may be prefened. Selection markers used routinely in transfomiation 
include the nptll gene, which confers resistance to kanamycin and related antibiotics 
(Messing & Vierra, Gene 19: 259-268 (1982); Bevan etal., Nature 304:184-187 (1983)), the 
bar gene, which confers resistance to the herbicide phosphinothricin (White et al., NucI 
Acids Res 18: 1 062 (1 990),, Spencer et al. Theor AppI Genet 79: 625-631 (1 990)), the hph 
gene, which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol 
Ce//e/o/ 4; 2929-2931), and the offtfrgene, which confers resistance to methotrexate 
(Bourouis et al., EMBO J. 2(7): 1099-1 104 (1983)). 

I. Constmction of Vectors Suitable for Agrobacterium Transformation 

Many vectors are available for transformation using Agrobacterium tumefaclens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
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pBIN1 9 (Bevan, NucL Acids Res, (1 984)) and pXYZ. Below the construction of two typical 
vectors is described. 

Construction of pCIB200 and pCIB2001: The binary vectors pCIB200 and pCIB2001 
are used for the construction of recombinant vectors for use with Agrobacterium and was 
constructed in the following nnanner. pTJS75kan was created by Narl digestion of pTJS75 
(Schmidhauser & Helinski, J BacterioL 164: 446-455 (1985)) allowing excision of the 
tetracycline-resistance gene, followed by insertion of an >4cc/ fragment from pUC4K carrying 
an NPTII (Messing & Vierra, Gene 19: 259-268 (1982); Bevan efa/.. Nature 304: 184-187 
(1 983); McBride et aL, Plant Molecular Biology 14: 266-276 (1 990)). Xhol linkers were 
ligated to the Ecofll/ fragment of pCIB7, which contains the left and right T-DNA borders, a 
plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et aL, Gene 53: 
153-161 (1987)), and the X/?o/-digested fragment was cloned into Sa//-digested pTJS75kan 
to create pCIB200 (see also EP 0 332 104, example 19). pCIB200 contains the following 
unique polylinker restriction sites: EcoRI, Sstl, Kpnl, Bglll, Xbal, and SalL pCIB2001 is a 
derivative of pCIB200, which is created by the insertion into the polylinker of additional 
restriction sites. Unique restriction sites In the polylinker of pCIB2001 are EcoRI, SstI, Kpnl, 
Bglll, Xbal, Sail, tMlul, Boll, Avrll, Apal. Npal,^ arid Stul. pClB2001 , in addition to containing 
these unique restriction sites also has plant and bacterial kanamycin selection, left and right 
T-DNA borders for-Agrrodacfenum-mediafed transfomnation, the RK2-derived trfA function 
for mobilization between E. c&//and other hosts, and the O/fTand OnVfunctions also from 
RK2; The pCIB2001> polylinker is suitable for the cloning of plant expression cassettes 
^^Gontainingtheir-^owmregulatbry signals. 

* Construetion-fof pSIBI 0 and Hygromycirii Selection Derivatives thereof: The binary 
v^ector pCIB10:Contains a gene encoding kanamycin resistance for selection in plants, T- 
-DNA right and left border sequences and incorporates sequences from the wide host-range 
plasmid pRK252 allowing it to replicate in both E coll and Agrobacterium, Its constmction is 
described by Rothstein et aL, Gene 53: 153-161 (1987). Various derivatives of pCIBIO 
have been constmcted that incorporate the gene for hygromycin B phosphotransferase 
described by Gritz et aL, Gene 25: 179-188 (1983)). These derivatives enable selection of 
transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin 
(PCIB715, PCIB717). 

11. Construction of Vectors Suitable for non-Agrobacterium Transformation. 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transfomnation vector and consequently 
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vectors lacking these sequences can be utilized in addition to vectors such as the ones 
described above that contain T-DNA sequences. Transformation techniques that do not 
rely on Agrobacterium include transfomiation via particle bombardment, protoplast uptake 
{e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on 
the preferred selection for the species being transformed. Below, the construction of some 

typical vectors Is described. 

Construction of pCIB3064: pCIB3064 is a pUC-derived vector suitable for direct 
gene transfer techniques in combination with selection by the herbicide basta (or 
phosphinothricin). The plasmid pCIB246 comprises the CaMV 35S promoter in operational 
fusion to the E. coli GUS gene and the CaMV 35S transcriptional terminator and is 
described in the PCT published application WO 93/07278. The 35S promoter of this vector 
contains two ATG sequences 5' of the start site. These sites were mutated using standard 
PGR techniques in such a way as to remove the ATG's and generate the restriction sites 
Sspl and Pvull. The new restriction sites were 96 and 37-bp away from the unique Sail site 
and 101 and 42-bp away from the actual start site. The resultant derivative of pCIB246 was 
designated pCIB3025. The GUS gene was then excised from pCIB3025 by digestion with 
Sail and Sad, the termini rendered blunt and religated to generate plasmid pCIB3060. The 
plasmid pJIT82 was obtained from the John Innes Centre, Nonwich and the a 400-bp Smal 
fragment containing the bar gene from Streptomyces viridochromogenes was excised and 
inserted into the Hpa/ site of pCIB3060 (Thompson etal. EMBO J 6: 2519-2523 (1987)). 
This generated pCIB3064, which comprises the tiargene under the control of the CaMV 
35S promoter and terminator for heriDicide selection, a gene for ampicillin resistance (for 
selection in E. coll) and a polylinker with the unique sites SphI, PstI, HIndlll, and BamHI. 
This vector is suitable for the cloning of plant expression cassettes containing their own 
regulatory signals. 

Construction of pS0G19 and pSOG35: pSOG35 is a transformation vector that 
utilizes the E. co// gene dihydrofolate reductase (DHFR) as a selectable mari<er conferring 
resistance to methotrexate. PCR was used to amplify the 35S promoter (-800-bp), intron 6 
from the maize Adhi gene (~550-bp) and 18-bp of the GUS untranslated leader sequence 
from pSOGIO. A 250-bp fragment encoding the E. co// dihydrofolate reductase type II gene 
was also amplified by PCR and these two PCR fragments were assembled with a Sacl-PstI 
fragment from pBI221 (Clontech), which comprised the pUCI 9 vector backbone and the 
nopaline synthase terminator. Assembly of these fragments generated pS0G19, which 
contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR 
gene and the nopaline synthase terminator. Replacement of the GUS leader in pS0G19 
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with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generated the vector 
pSOG35. pSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and have 
Hindlll, Sphlj PstI and EcoRI sites available for the cloning of foreign sequences. 

Example 22: Construction of Plant Expression Cassettes 
Gene sequences intended for expression in transgenic plants are firstly assembled 
in expression cassettes behind a suitable promoter and upstream of a suitable transcription 
terminator. These expression cassettes can then be easily transferred to the plant 
transformation vectors described above in Example 21 . 

I. Promoter Selection 

The selection of a promoter used in expression cassettes will determine the spatial 
and temporal expression pattem of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyir cells, root cortex cells) or in specific tissues or organs (roots,, leaves or flowers, 
for example) and this selection will reflect the desired location of expression of the 
transgene. * Alternatively, theselected promoter may driye expression of the gene under a 
lightrinduced or other,' temporally regulated promoter, A further alternative is that the 
selected promoter be chemically regulated. This would provide the possibility of inducing 

v^expression of thejransgene only when desired^^a caused by treatment with a chemical 
inducer. 

rUlSlFIFranscrtpti^ 

A variety of transcriptional temiinators are available fpr use In expression cassettes. 
';.rThese are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional tenrninators arp those that are known to 
function in plants and include the CaMV 35S terminator, the fm/temninator, the nopaline 
synthase terminator, the pea rbcS E9 tenninator, as well as terminators naturally associated 
with the plant protox gene (i.e. "protox terminators"). These can be used in both 
monocotyledons and dicotyledons. 

111. Sequences for the Enhancement or Regulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 
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Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize Adh1 gene have been 
found to significantly enhance the expression of the wild-type gene under its cognate 
promoter when introduced into maize cells. Intron 1 was found to be particularly effective 
and enhanced expression in fusion constructs with the chloramphenicol acetyltransferase 
gene (Callis etaL, Genes Develop. 1: 1183-1200 (1987)). In the same experimental 
system, the intron from the maize bronzel gene had a similar effect in enhancing 
expression (Callis etaL, supra), Intron sequences have been routinely incorporated into 
plant transformation vectors, typically within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known 
to enhance expression, and these are particularly effective in dicotyledonous cells. 
Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the 'W-sequence"), Maize 
Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be 
effective in enhancing expression {e.g. Gallie etaL NucL Acids Res. 15: 8693-8711 (1987); 
Skuzeski et aL Plant Molec. Biol. 1 5: 65-79 (1 990)) 

IV. Targeting of the Gene Product Within the Cell 

Various mechanisms for targeting gene products are known to exist in plants and 
the sequences controlling the functioning of these mechanisms have been characterized in 
some detail. For example, the targeting of gene products to the chloroplast is controlled by 
a signal sequence that is found at the amino terminal end of various proteins and that is 
cleaved during chloroplast import yielding the mature protein {e.g. Comai etaL J. BioL 
Chem. 263: 15104-15109 (1988)). These signal sequences can be fused to heterologous 
gene products to effect the import of heterologous products Into the chloroplast (van den 
Broeck etaL Nature 313: 358-363 (1985)). DNA encoding for appropriate signal sequences 
can be isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB 
protein, the EPSP synthase enzyme, the GS2 protein and many other proteins that are 
known to be chloroplast localized. 

Other gene products are localized to other organelles such as the mitochondrion 
and the peroxisome (e.g. Unger et aL Plant Molec. BioL 13: 41 1-41 8 (1 989)). The cDNAs 
encoding these products can also be manipulated to effect the targeting of heterologous 
gene products to these organelles. Examples of such sequences are the nuclear-encoded 
ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting to 
cellular protein bodies has been described by Rogers et aL, Proc. NatL Acad. ScL USA 82: 
6512-6516(1985)). 
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In addition, sequences have been characterized that cause the targeting of gene 
products to other cell compartments. Amino terminal sequences are responsible for 
targeting to the ER, the apoplast, and extracellular secretion from aleurone cells (Koehler & 
Ho. Plant Cell 2: 769-783 (1990)). Additionally, amino terminal sequences in conjunction 
with carboxy tenninal sequences are responsible for vacuolar targeting of gene products 
(Shinshi etaL, Plant Molec. BloL 14: 357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or cell 
compartment. For chloroplast targeting, for example, the chloroplast signal sequence from 
the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in 
frame to the amino temninal ATG of the transgene. The signal sequence selected should 
include the known cleavage site and the fusion constmcted should take into account any 
amino acids after the cleavage site that are required for cleavage. In some cases this 
requirement may be fulfilled by the addition of a small number of amino acids between the 
cleavage site and the transgene ATG or alternatively replacement of some amino acids 
within the transgene sequence. Fusions constructed for chloroplast import can be tested 
^-for-eff icacy of chloroplast uptake by in wfro translation of /n wYro transcribed constructions 
followed by. /n.v/fro chloroplast uptake using techniques described by (Bartlett era/. In: 
>:Edelmann.ef a/. (Eds.) Methods in Chloroplast Molecular Biology, Elsevier, pp. 1081-1091 
:^(4982),'^Wasmann -era/.-/Wo/.--Gen. Genet 205: 446-453 (1986)). These construction 
' ^techniques are well known in the art and are equally applicable to mitochondria and 
^peroxisomes. The choice of targeting that may be required for expression of the 
.»>trarpsgenes wilNepend-on.the the precursor required as the starting 

,;:irp0intMfor-^aigiy usually be cytosolic or chloroplastic, although it may is 

some cases be mitochondrial or peroxisomal. The products of transgene expression will not 
nomnally require targeting to the ER, the apoplast or the vacuole. 

The above described mechanisms for cellular targeting can be utilized not only in 
conjunction with their cognate promoters, but also in conjunction with heterologous 
promoters so as to effect a specific cell targeting goal under the transcriptional regulation of 
a promoter that has an expression pattern different to that of the promoter from which the 
targeting signal derives. 

Example 23: Transformation of Dicotyledons 
Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-hased techniques and techniques that do not require Agrobacterium. Non- 
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Agrobacterium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques 
are described by Paszkowski et al, EMBO J 3: 271 7-2722 (1 984), Potrykus et ai, MoL 
Gen, Genet 199: 169-177 (1985), Reich ef a/., Biotechnology 4: 1001-1004 (1986), and 
Klein et aL, Nature 327: 70-73 (1987). In each case the transformed cells are regenerated 
to whole plants using standard techniques known in the art. 

vAgrobacfer/um-mediated transformation is a preferred technique for transformation 
of dicotyledons because of its high efficiency of transformation and its broad utility with 
many different species. The many crop species that are routinely transfomnable by 
Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, potato, soybean, 
alfalfa and poplar (EP 0 317 511 (cotton), EP 0 249 432 (tomato, to Calgene), WO 
87/07299 {Brassica, to Calgene), US 4,795,855 (poplar)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant and follows 
protocols well known in the art. Transformed tissue is regenerated on selectable medium 
carrying the antibiotic or herbicide resistance marker present between the binary plasmid T- 
DNA borders. 

Example 24: Transformation of Monocotyledons 
Transformation of most monocotyledon species has now also become routine. 
Prefen-ed techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species (/.e. co- 
transfomiation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complex vector construction and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
marker, enabling the removal of the selectable marker in subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transformation is the 
less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher et al. Biotechnology 4: 1 093-1 096 (1 986)). 

Patent Applications EP 0 292 435 (to Ciba-Geigy), EP 0 392 225 (to Ciba-Geigy) 
and WO 93/07278 (to Ciba-Geigy) describe techniques for the preparation of callus and 
protoplasts from an elite inbred line of maize, transformation of protoplasts using PEG or 
electroporation, and the regeneration of maize plants from transformed protoplasts. 
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Gordon-Kamm ef a/., Plant Ce// 2; 603-61 8 (1990)) and Fromm etaL, Biotechnology 8: 833- 
839 (1990)) have published techniques for transformation of A188-derived maize line using 
particle bombardment. Furthermore, application WO 93/07278 (to Ciba-Geigy) and Koziel 

a/.. Biotechnology 1 1: 194-200 (1993)) describe techniques for the transformation of elite 
inbred lines of maize by particle bombardment. This technique utilizes immature maize 
embryos of 1,5-2.5 mm length excised from a maize ear 14-15 days after pollination and a 
PDS-1000He Biolistics device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been 
described for Japonica-types and Indica-Xypes (Zhang et aL, Plant Cell Rep 7: 379-384 
(1988); Shimamoto ef a/. Nature 33a 274-277 (1989); Datta etaL Biotechnology 8:736-740 
(1990)). Both types are also routinely transfomnable using particle bombardment (Christou 
et ai Biotechnology 9: 957-962 (1 991 )). 

Patent Application EP 0 332 581 (to Ciba-Geigy) describes techniques for the 
generation, transformation and regeneration of Pooideae protoplasts. These techniques 
allow the transformation of Dactylis and wheat. Furthermore, wheat transformation was 
^been- described by Vasil et aL; Biotechnology 70: 667-674 (1992)) using particle 
'.-bombardmentMntp^cells of type C long-tenm regenerable callus, and also by Vasil et aL, 
-Biotechnology H; 1553-1558 (1993)) and Weeks etaL, Plant PhysioL 10Z 1077-1084 
-.A^(4993) using^particle bombardmentof immature embryos and immature embryo-derived 
" callus. A preferred technique for. wheat transformation, however-involves the 
s*transformation of wheat by particle bombardment of-immature embryos and includes either 
^s^a^lgh^sucrose or a^ high maltose istep pri gene delivery; Prior to bombardment, any 
^^number of.embryos (0.75-1 mm in length)rare plated onto MS medium with 3% sucrose 
^Murashige & Skoog, Physiologia Plantamm 15: 473-497 (1962)) and 3 mg/l 2,4-D for 
induction of somatic embryos, which is allowed to proceed in the dark. On the chosen day 
of bombardment, embryos are removed from the induction medium and placed onto the 
osmoticum (/.e. induction medium with sucrose or maltose added at the desired 
concentration, typically 15%), The embryos are allowed to plasmolyze for 2-3 h and are 
then bombarded. Twenty embryos per target plate is typical, although not critical. An 
appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated onto 
micrometer size gold particles using standard procedures. Each plate of embryos is shot 
with the DuPont Biolistics* helium device using a burst pressure of -1000 psi using a 
standard 80 mesh screen. After bombardment, the embryos are placed back into the dark 
to recover for about 24 h (still on osmoticum). After 24 hrs, the embryos are removed from 



90 



wo 01/68826 PCT/EPOl/02753 

the osmoticum and placed back onto induction medium where they stay for about a month 
before regeneration. Approximately one month later the embryo explants with developing 
embryogenic callus are transfen-ed to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter 
GA), further containing the appropriate selection agent (10 mg/l basta in the case of 
pClB3064 and 2 mg/l methotrexate in the case of pSOG35). After approximately one 
month, developed shoots are transferred to larger sterile containers known as "GA7s" that 
contained half-strength MS, 2% sucrose, and the same concentration of selection agent. 
Patent application WO 94/13822 describes methods for wheat transformation and is hereby 
incorporated by reference. 

Example 25: Isolation of the Arabidopsis thaliana Protox-1 Promoter Sequence 
A Lambda Zap II genomic DNA library prepared from Arabidopsis thaliana 
(Columbia, whole plant) was purchased from Stratagene. Approximately 125,000 phage 
were plated at a density of 25,000 pfu per 15 cm Petri dish and duplicate lifts were made 
onto Colony/Plaque Screen membranes (NEN Dupont). The plaque lifts were probed with 
the Arabidopsis protox-1 cDNA (SEQ ID N0:1 labeled with 32P-dCTP by the random 
priming method (Life Technologies). Hybridization and wash conditions were at 65^*0 as 
described in Church and Gilbert, Proa NatL Acad. Sci, USA 81: 1991-1995 (1984). 
Positively hybridizing plaques were purified and in vivo excised into pBluescript plasmids. 
Sequence from the genomic DNA inserts was determined by the chain termination method 
using dideoxy temainators labeled with fluorescent dyes (Applied Biosystems, Inc.). One 
clone, AraPTI Pro, was determined to contain 580-bp of Arabidopsis sequence upstream 
from the initiating methionine (ATG) of the protox-1 protein coding sequence. This clone 
also contains coding sequence and introns that extend to-bp 1241 of the protox-1 cDNA 
sequence. The 580-bp 5' noncoding fragment is the putative Arabidopsis protox-1 
promoter, and the sequence is set forth in SEQ ID N0:13. 

AraPTI Pro was deposited December 15, 1995, as pWDC-11 (NRRL #B-21515> 

Example 26: Construction of Plant Transformation Vectors Expressing Altered Protox-1 

Genes Behind the Native Arabidopsis Protox-1 Promoter 
A full-length cDNA of the appropriate altered Arabidopsis protox-1 cDNA was 
isolated as an Ecofl/-X/?o/ partial digest fragment and cloned into the plant expression 
vector pCGN1 761 ENX (see Example 9 of International application no. PCT/IB95/00452 
filed June 8, 1995, published Dec. 21 , 1995 as WO 95/34659). This plasmid was digested 
with Ncol and BamHI to produce a fragment comprised of the complete protox-1 cDNA plus 
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a transcription terminator from the 3' untranslated sequence of the frn/gene of 
Agrobacterium tumefaciens. The AraPTI Pro plasmid described above was digested with 
Ncol and BamHIXo produce a fragment comprised of pBluescript and the 580-bp putative 
Arabidopsis protox-1 promoter. Ligation of these two fragments produced a fusion of the 
altered protox cDNA to the native protox promoter. The expression cassette containing the 
protox-1 promoter/protox-1 cDNA/fm/ terminator fusion was excised by digestion with Kpnl 
and cloned into the binary vector pCIB200. The binary plasmid was transformed by 
electroporation into Agrobacterium and then into Arabidopsis using the vacuum infiltration 
method (Bechtold ef a/., C.R Acad. Sci. Paris 3^6: 1194-1199 (1993). Transformants 
expressing altered protox genes were selected on kanamycin or on various concentrations 
of protox inhibiting herbicide. 

Example 27: Production of Herbicide Tolerant Plants by Expression of a Native Protox-1 

Promoter/Altered Protox-1 Fusion 
Using the procedure described above,*an Arabidopsis proXox-l cDNA containing a 
TAG to ATG (Tyrosine to Methionine) change at nucleotides 1306-1308 in the protox-1 
^sequence (SEQ ID NO;1) was fused to the native protox-1 promoter fragment and 
transformed into vArafc/cfops/s f/7a//ana This altered protox-1 enzyme (Arae-2Met) has been 
-shown to^be >'10-f old-more tolerant to various protox-inhibiting heriDicides than the naturally 
- OGCumng^enzyme-when^tested in the previously described bacterial expression system. 
Seed from the vacuum infiltrated plants was collected and plated on a range (lO.OnM- 
1 iOuM) of aprotox inhibitory aryiuracil hertDicide of fomnula XVII. Multiple experiments with 
>wildi^type'>4ra6/c/ops/s have shown^that a 1 0.OnM concentration of this compound is 
.sufficient torprevent normal seedling germination. Transgenic seeds expressing the AraC- 
2Met altered enzyme fused to the native protox-1 promoter produced normal Arabidopsis 
seedlings at herbicide concentrations up to 500nM. indicating at least 50-fold higher 
herbicide tolerance when compared to wild-type Arabidopsis. This promoter/altered protox 
enzyme fusion therefore functions as an effective selectable mari<er for plant 
transfomnation. Several of the plants that gemninated on lOO.OnM of protox-inhibiting 
hertDicide were transplanted to soil, grown 2-3 weeks, and tested in a spray assay with 
various concentrations of the protox-inhibiting herbicide. When compared to empty vector 
control transformants, the AraPTI Pro/AraC-2Met transgenics were >10-fold more tolerant to 
the heriDicide spray. 
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EXAMPLE 28: Demonstration of resistant mutations* cross-tolerance to various protox- 

inhibiting compounds in an Arabidopsis germination assay. 
Using the procedure described above, an Arabidopsis protox-1 cDNA containing 
both a TAG to ATC (tyrosine to isoleucine) change at nucleotides 1306-1308 and a TCA to 
TTA (serine to leucine) change at nucleotides 945-947 in the protox-1 sequence (SEQ ID 
N0:1) was fused to the native protox-1 promoter fragment and transformed into Arabidopsis 
thaliana. This altered protox-1 enzyme (AraC-2lle + AraC305Leu) has been shown to be 
>1 0-fold more tolerant to a protox inhibitory aryluracil herbicide of formula XVII than the 
naturally occurring enzyme when tested in a bacterial system (see Examples 9-13). 
Homozygous Arabidopsis lines containing this fusion were generated from transf omnants 
that showed high tolerance to a protox inhibiting herbicide in a seedling germination assay 
as described above. The seed from one line was tested for cross-tolerance to various 
protox-inhibitory compounds by repeating the germination assay on concentrations of the 
compounds that had been shown to inhibit germination of wild-type Arabidopsis. The 
results from these experiments are shown In Table 4. 



Example 29: Isolation of a Maize Protox-1 Promoter Sequence 
A Zea Mays (Missouri 17 inbred, etiolated seedlings) genomic DNA library in the 
Lambda FIX II vector was purchased from Stratagene. Approximately 250,000 pfu of the 
library was plated at a density of 50,000 phage per 15 cm plate and duplicate lifts were 
made onto Colony/Plaque screen membranes (NEN Dupont). The plaque lifts were probed 
with the maize protox-1 cDNA (SEQ ID NO:5) labeled with 32P-dCTP by the random 
priming method (Life Technologies). Hybridization and wash conditions were at 65^*0 as 
described in Church and Gilbert, Proa NatL Acad, Sci, USA 81: 1991-1995 (1984). 
Lambda phage DNA was isolated from three positively hybridizing phage using the Wizard 
Lambda Preps DNA Purification System (Promega), Analysis by restriction digest, 
hybridization pattems, and DNA sequence analysis identified a lambda clone containing 
approximately 3.5 kb of maize genomic DNA located 5' to the maize protox-1 coding 
sequence previously isolated as a cDNA clone. This fragment includes the maize protox-1 
promoter. The sequence of this fragment is set forth in SEQ ID NO:14. From nucleotide 1 
to 3532, this sequence is comprised of 5' noncoding sequence. From nucleotide 3533 to 
3848, this sequence encodes the 5' end of the maize protox-1 protein. 

A plasmid containing the sequence of SEQ ID NO:14 fused to the remainder of the 
maize protox-1 coding sequence was deposited March 19, 1996 as pWDG-14 (NRRL #B- 
21546). 
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Example 30: Construction of Plant Transformation Vectors Expressing Altered Protox-1 

Genes Behind the Native Maize Protox-1 Promoter 
The 3848-bp maize genomic fragment (SEQ ID NO:14) was excised from the 
isolated lambda phage clone as a Sa//-Kpnl partial digest product and ligated to a KpnI-NotI 
fragment derived from an altered maize protox-1 cDNA that contained an alanine to leucine 
change at amino acid 164 (SEQ ID NO:6). This created a fusion of the native maize protox- 
1 promoter to a full length cDNA that had been shown to confer herbicide tolerance in a 
bacterial system (Examples 9-14). This fusion was cloned into a pUC18 derived vector 
containing the CaMV 35S terminator sequence to create a protox promoter/altered protox 
cDNA/terminator cassette. The plasmid containing this cassette was designated pWCo-1 . 

A second construct for maize transformation was created by engineering the first 
intron found in the coding sequence from the maize genomic clone back into the maize 
cDNA. The insertion was made using standard overlapping PGR fusion techniques. The 
Kintron (SEQ ID NO:25) was 93-bp long and was inserted between nucleotides 203 and 204 
of-rSEQ ID NO:6, exactly as it appeared in natural context in the lambda clone described in 
^Example 29. This intron-containing version of the expression cassette was designated 
-pWGo-2. 

'^Example Sl v '^Demonstration of.Maize Protox-1 Promoter Activity in Transgenic Maize 

Plants 

' Maize plants transformed with maize protox prornoter/altered protox fusions were 
l^ide^WiiBd using^PGR^analysis-witb primers s for the transgene. Total RNA was 
.f^-pcepared from the RCR-^positive plants and reverse-transcribed using Superscript M-MLV 
, ;^:(Life Technologies) under recommended conditions. Two microliters of the reverse 
transcription reaction was used in a PGR reaction designed to be specific for the altered 
protox sequence. While untransformed controls give no product in this reaction, 
approximately 85% of plants transfomned with pWGo-1 gave a positive result, indicating the 
presence of mRNA derived from the transgene. This demonstrates some level of activity for 
the maize protox promoter. The RNA's from the transgenic maize plants were also 
subjected to standard northem blot analysis using the radiolabeled maize protox cDNA 
fragment from SEQ ID N0:6 as a probe. Protox-1 mRNA levels significantly above those of 
untransformed controls were detected in some of the transgenic maize plants. This 
elevated mRNA level is presumed to be due to expression of altered protox-1 mRNA from 
the cloned maize protox promoter. 
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Example 32: Isolation of a Sugar Beet Protox-1 Promoter Sequence 
A genomic sugar beet library was prepared by Stratagene in the Lambda Fix II 
vector. Approximately 300,000 pfu of the library was plated and probed with the sugar beet 
protox-1 cDNA sequence (SEQ ID N0:17) as described for maize in Example 29. Analysis 
by restriction digest, hybridization patterns and DNA sequence analysis identified a lambda 
clone containing approximately 7 kb of sugar beet genomic DNA located 5' to the sugar 
beet coding sequence previously isolated as a cDNA clone. A Pstl-Sall fragment of 2606- 
bp was subcloned from the lambda clone into a pBluescript vector. This fragment contains 
2068-bp of 5' noncoding sequence and includes the sugar beet protox-1 promoter 
sequence. It also includes the first 453-bp of the protox-1 coding sequence and the 85-bp 
first intron contained in the coding sequence. The sequence of this fragment is set forth in 
SEQ ID NO:26. 

A plasmid containing the sequence of SEQ ID NO:26 was deposited December 6, 
1996 as pWDC-20 (NRRL #8-21650). 

Example 33: Construction of Plant Transformation Vectors Expressing Altered Sugar Beet 

Protox-1 Genes Behind the Native Sugar Beet Protox-1 Promoter 
The sugar beet genomic fragment (SEQ ID NO:26) was excised from the genomic 
subclone described in Example 32 as a SacZ-SsrO/ fragment that includes 2068-bp of 5' 
noncoding sequence and the first 300-bp of the sugar beet protox-1 coding sequence. This 
fragment was ligated to a BsrGI-Notl fragment derived from an altered sugar beet protox-1 
cDNA that contained a tyrosine to methionine change at amino acid 449 (SEQ ID NO: 18). 
This created a fusion of the native sugar beet protox-1 promoter to a full length cDNA that 
had been shown to confer herbicide tolerance in a bacterial system (Examples 9-14). This 
fusion was cloned into a pUC18 derived vector containing the CaMV 35S temninator 
sequence to create a protox promoter/altered protox cDNA/terminator cassette. The 
plasmid containing this cassette was designated pWCo-3. 

Example 34: Production of Herbicide Tolerant Plants by Expression of a Native Sugar Beet 

Protox-1 Promoter/Altered Sugar Beet Protox-1 Fusion 
The expression cassette from pWCo-3 is transformed into sugar beet using any of 
the transformation methods applicable to dicot plants, Including Agrobacterium, protoplast, 
and biolistic transformation techniques. Transgenic sugar beets expressing the altered 
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protox-1 enzyme are identified by RNA-PCR and tested for tolerance to protox-inhibiting 
herbicides at concentrations that are lethal to untransformed sugar beets. 



Section D: Expression of Protox Genes in Plant Plastids 

Example 35: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and Native clpP5' Untranslated Sequence Fused to a GUS Reporter Gene and 
Plastid rps16 Gene 3' Untranslated Sequence in a Plastid Transformation Vector 

L Amplification of the Tobacco Plastid cIpP Gene Promoter and Complete 5' Untranslated 
RNA (5' UTR). 

Total DNA from N, tabacum c.v. "Xanthi NC" was used as the template for PCR with 
a left-to-right "top strand" primer comprising an introduced EcoRI restriction site at position - 
197 relative to the ATG start codon of the constitutively expressed plastid cIpP gene (primer 
Pclp_P1a: 5'-GCGGAAIICATACTTATTTATCATTAGAAAG-3' (SEQ ID NO:27); EcoRI 
restriction site underlined) and a right-to-left "bottom strand" primer homologous to the 
region from -21-to -Irrelative to the AT& start codon of the c/pP promoter that incorporates 
an introduced WcoArestriction site at the start of translation (primer Pclp_P2b: 5'- 
' :GGGCeAiGGTAAATGAAAGAAAGAAeTAAA-3' (SEQ ID NO:28); Ncol restriction site 
^-underlined).f''ThiS'PeR''reaction was undertaken with P/a thermostable DNA polymerase 

(Stratagene, La Jolla CA) in a Perkin Elmer Themial Cycler 480 according to the 
' manufacturer's recommendations (Perkin Elmer/Roche, Branchburg, NJ) as follows: 7 min 
95°Crfollowed% 4^GyGles of 1 min^^M^ min 72*^0. then 25 cycles of 1 min 

95°G / 2,min 55°C / 1; min 72°C. The 21 3-bp amplification product comprising the promoter 
and 5' untranslated region of the c/pP gene containing an EcoRI s\ie at its left end and an 
A/cfo/ site at its right end and corresponding to nucleotides 74700 to 74505 of the N. 
fabacum plastid DNA sequence (Shinozaki etaL, EMBOJ, 5: 2043-2049 (1986)) was gel 
purified using standard procedures and digested with EcoRI and Ncol (all restriction 
enzymes were purchased from New England Biolabs, Beverly, MA). 

II. Amplification of the Tobacco Plastid rps16 Gene 3' Untranslated RNA Sequence 
(3'UTR). 

Total DNA from A/, tabacum c.v. "Xanthi NC" was used as the template for PCR as 
described above with a left-to-right "top strand" primer comprising an introduced Xbal 
restriction site immediately following the TAA stop codon of the plastid rps16 gene encoding 
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ribosomal protein S16 (primer rps16P_1a (5'- 

GCGIQIAGATCAACCGAAATTCAATTAAGG-y (SEQ ID NO:30); Xbal restriction site 
underlined) and a right-to-left "bottom strand" primer homologous to the region from +134 to 
+151 relative to the TAA stop codon of rps1 6 \ha\ incorporates an introduced Hindlll 
restriction site at the 3' end of the rps163' UTR (primer rps16P_1b (5'- 
CGC AAGCTT CAATGGAAGCAATGATAA-3' (SEQ ID NO:31); Hindlll restriction site 
underlined). The 169-bp amplification product comprising the 3' untranslated region of the 
rps16 gene containing an Xbal site at its left end and a Hindlll site at its right end and 
containing the region con"esponding to nucleotides 4943 to 5093 of the N. tabacum plastid 
DNA sequence (Shinozaki et aL, 1986) was gel purified and digested with Xbal and Hindlli 

III. Ligation of a GUS Reporter Gene Fragment to the clpP Gene Promoter and 5' and 3' 
UTR's. 

An 1864-bp p-glucuronidase (GUS) reporter gene fragment derived from plasmid 
pRAJ275 (Clontech) containing an Ncol restriction site at the ATG start codon and an Xbal 
site following the native 3' UTR was produced by digestion with Ncol and Xbal This 
fragment was ligated in a four-way reaction to the 201 -bp EcoRI/Ncol cIpP promoter 
fragment, the 157-bp Xbal/ Hindlll rps1 6 3VJR fragment, and a 3148-bp EcoRII Hindlll 
fragment from cloning vector pGEM3Zf(-) (Promega, Madison Wl) to construct plasmid 
pPH138. Plastid transfonmation vector pPH140 was constructed by digesting plasmid 
pPRVnia (Zoubenko etaL 1994) with EcoRI and Hindlll and ligating the resulting 7287-bp 
fragment to a 2222-bp EcoRI/Hindlll UagmenX of pPH138, 

Example 36: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter Plus Tobacco Plastid psbA Gene Minimal 5' Untranslated Sequence Fused to a 
GUS Reporter Gene and Plastid rp$16 Gene 3' Untranslated Sequence in a Plastid 

Transfomnation Vector 
Amplification of the tobacco plastid cIpP gene promoter and truncated 5* 
untranslated RNA (5* UTR): Total DNA from N. tabacum c.v. "Xanthi NC" was used as the 
template for PGR as described above with the left-to-right "top strand" primer Pclp^Pla 
(SEQ ID NO:27) and a right-to-left "bottom strand" primer homologous to the region from - 
34 to -1 1 relative to the ATG start codon of the cIpP promoter that incorporates an 
introduced Xbal restriction site in the cIpPS' UTR at position -1 1 (primer Pclp_P1b: 5'- 
GCG TCTAGAA AGAACTAAATACTATATTTCAC-3' (SEQ ID NO:29); Xbal restriction site 
underlined). The 202-bp amplification product comprising the promoter and truncated 5' 
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UTR of the c/pP gene containing an EcoRI sWe at its left end and an Xbal site at its right 
end was gel purified and digested with XbaL The Xbal sWe was subsequently filled in with 
Klenow DNA polymerase (New England Biolabs) and the fragment digested with EcoRL 
This was ligated in a five-way reaction to a double stranded DNA fragment corresponding to 
the final 38 nucleotides and ATG start codon of the tobacco plastid psbA gene 5' UTR {with 
an Ncol restriction site overhang introduced into the ATG start codon) that was created by 
annealing the synthetic oligonucleotides minpsb_LI (top strand: 5'- 

GGGAGTCCCTGATGATTAAATAAACCAAGATTTTAC-3' (SEQ ID NO:32)) and minpsb_L 
(bottom strand: 5'-CAIGGTAAAATCTTGGTTTA7TTAATCATCAGGGACTCCC-3' (SEQ ID 
NO:33); A/co/ restriction site 5' overhang underlined), the Ncol/XbalGUS reporter gene 
fragment described above, the Xbal/Hindlll rps16 3VTR fragment described above, and the 
EcoRI/Hindlll pGEM3Zf (-) fragment described above to constnjct plasmid pPHI 39. Plastid 
transformation vector pPH1 44 was constructed by digesting plasmid pPRV1 11a (Zoubenko, 
ef a/., Nucleic Acids Res 22: 381 9-3824 (1 994)) with EcoRI and Hindlll and ligating the 
resulting 7287-bp fragment to a 2251 -bp EcoRII Hindlll fragment of pPHI 39. 

-I'Example 37: Preparation^of^^ Plastid cIpP Gene 

-Promoter and Complete 5' Untranslated Sequence Fused to the Arabidopsis thaliana 
-^^tpfotox^t'-Goding Sequence-^and^Plastid^/pstSGene 3^ In a Vector 

Mdf Tobacccf Plastid Transformation 
^ Miniprep DNA f rom plasmid AraGr2Met carrying an Arabidopsis thaliana NotI insert 
^>>thBtMhcIudesTe©NA^sequ^e^^^ the^Rrotoporphyrinogen IX Oxidase ("protox") gene 

^"^ieriG(3ding a pdrtion^df theiaminb>:terminalf>plastid transitcpeptide, the^^ cDNA and a 

■portion of the.3'. untranslated, region was.used as the template for PGR as described above 
using a left-to-right top strand" primer (with homology to nucleotides +172 to +194 relative 
to the ATG start codon of the full length precursor protein) comprising an introduced Ncol 
restriction site and new ATG start codon at the deduced start of the mature protox protein 
coding sequence (primer APRTXP1 a: 5- 

GGGACCAIGGATTGTGTGATTGTCGGCGGAGG-3' (SEQ ID NO:34); Ncol restriction site 
underiined) and a right-to-left "bottom strand" primer homologous to nucleotides +917 to 
+940 relative to the native ATG start codon of the protox precursor protein (primer 
APRTXPIb: 5'- CTCCGCTCTCGAGGTTAGTGATAG-3' (SEQ ID NO:35)), The 778-bp 
product was digested with Ncol and Sft//and the resulting 682-bp fragment ligated to an 
844-bp SMI NotI DNA fragment of AraC-2Met comprising the 3' portion of the protox coding 
sequence and a 2978-bp Afco/^A/of/ fragment of the cloning vector pGEM5Zf(+) (Promega, 
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Madison Wl) to construct plasmid pPH1 41 . Plastid transformation vector pPH1 43 
containing the cIpP pronr^oter driving the Formula XVII-resistant AraC-2Met protox gene with 
the rps163* UTR was constructed by digesting pPH141 with Ncolan6 Ssp/ and isolating 
the 1491 -bp fragment containing the complete protox coding sequence, digesting the 
rps16P„1a and rps16P_.1b PGR product described above with Hindlll, and Itgating these to 
a 7436-bp /Vco/ZH/nd/// fragment of pPH140. 

Example 38: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter Plus Tobacco Plastid psbA Gene Minimal 5' Untranslated Sequence Fused to the 

Arabidopsis thai/ana Protox-1 Coding Sequence and Plastid rps16 Gene 3' Untranslated 

Sequence in a Vector for Tobacco Plastid Transformation 
Plastid transfomnation vector pPH145 containing the c/pPp remote r/psM 5' UTR 
fusion driving the Formula XVII-resistant AraC-2Met protox gene with the rpsW 3' UTR was 
constructed by digesting pPH141 with Ncol and Ssp/ and isolating the 1491 -bp fragment 
containing the complete protox coding sequence, digesting the rps16P_1a and rps16P_1b 
PCR product described above with Hindlll, and ligatihg these to a 7465-bp Ncol/Hindlll 
fragment of pPH144. 

Example 39: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to the EPSP Synthase Coding Sequence 
and Plastid rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid 

Transformation 

A cDNA library is screened for the 5-enolpyruvyl-3-phosphoshikimate synthase 
(EPSP synthase) gene (U.S. Patent Nos. 5,310,667, 5,312,910. and 5,633,435, all 
incorporated herein by reference). A plasmid clone containing the full length EPSP 
synthase gene cDNA is isolated by standard techniques of molecular cloning. PCR primers 
are designed for amplification of the mature-size EPSP synthase coding sequence from this 
plasmid using a top strand primer having a 5' extension containing an Ncol restriction site 
inserted at amino acid -1 from the deduced mature protein start, thus creating an ATG start 
codon at this position, and a bottom strand primer having a 5' extension containing an Xbal 
restriction site downstream of the stop codon of the EPSP mature coding sequence in the 
amplified PCR product. The PCR amplification is performed using the designated primers 
and plasmid DNA template according to standard protocols. Amplified products are cloned 
and sequenced and a Ncol-Xbal DNA fragment containing the complete mature EPSP 
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synthase coding sequence is isolated by restriction digest with Ncol and Xbal, 
electrophoresis on a 0.8% TAE agarose gel, and phenol extraction of the excised band. 

A plastid transformation vector containing the cIpP promoter directing transcription of 
the mature-sized EPSP synthase gene with the rps163' UTR is constructed by digesting 
pPH140 with Ncol and Xjba/and purifying the fragment containing the vector backbone, 5' 
and 3' plastid integration targeting sequences, aacM selectable marker cassette, and cIpP 
promoter / rps16 3' UTR expression sequences. This product is ligated in a two-way 
reaction with the Ncol-Xbal DNA fragment containing the mature-sized EPSP synthase 
coding sequence isolated as described above. 

Example 40: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to the ALS Coding Sequence and Plastid 
tpsie Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid Transfonmation 
A cDNA library is screened for the acetolactate synthase (ALS) gene (U.S. Patent 
No^ 5,013,659). A plasrnid clone containing the fall length ALS gene cDNA is isolated by 
standard techniques of molecular cloning. PGR primers are designed for amplification of the 
'^^^mature^size ALS'^coding' sequence from this plasmid us;inga top strand primer having a 5' 
-^extension GontainingiantA/coArestriction site inserted at amino acid -1 from the deduced 
-mature protein :start;.thus creating an ATG start codqn at this position, and a bottom strand 
./fi'primer^having a^5Jj?extension^Gontaining an Xda/ restriction site downstream of the stop 
Ti?codon^of4he^ALS mature:GOding sequence in the amplified PCR product. The PCR 
^^amplifieation-is-perfonned^.using' the designated primers and plasmid DNA template 
viaeeordifijg^toistandard^protocols^Amplified cloned^ and sequenced and a Ncol- 

.#(da/#NA^fragment containing ^the complete mature ALS coding.sequence is isolated by 
.i^restriction digest with Ncol and Xbal, electrophoresis on a 0.87o TAE agarose get, and 
phenol extraction of the excised band. 

A plastid transformation vector containing the c/pP promoter driving the mature-sized 
ALS gene with the rps16 3' UTR is constructed by digesting pPH1 40 with Ncol and Xbal 
and purifying the fragment containing the vector backbone, 5' and 3' plastid integration 
targeting sequences, aadA selectable maricer cassette, and c/pP promoter / /ps75 3' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized ALS coding sequence isolated as described 
above. 
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Example 41 : Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to the AHAS Coding Sequence and Plastid 
rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid Transformation 

A cDNA library is screened for the acetohydroxyacid synthase (AHAS) gene (U.S. 
Patent No. 4.761 ,373). A plasmid clone containing the full length AHAS gene cDNA is 
isolated by standard techniques of molecular cloning. PCR primers are designed for 
amplification of the mature-size AHAS coding sequence from this plasmid using a top 
strand primer having a 5' extension containing an Ncol restriction site inserted at amino acid 
-1 from the deduced mature protein start, thus creating an ATG start codon at this position, 
and a bottom strand primer having a 5' extension containing an Xbal restriction site 
downstream of the stop codon of the AHAS mature coding sequence in the amplified PCR 
product. The PCR amplification is perfomned using the designated primers and plasmid 
DNA template according to standard protocols. Amplified products are cloned and 
sequenced and a Ncol-Xbal DNA fragment containing the complete mature AHAS coding 
sequence is isolated by restriction digest with Wco/ and Xbal, electrophoresis on a 0.8% 
TAE agarose gel, and phenol extraction of the excised band. 

A plastid transfomnation vector containing the c/pP promoter driving the mature-sized 
AHAS gene with the rps16 3' UTR is constructed by digesting pPH140 with Ncol and Xbal 
and purifying the fragment containing the vector backbone, 5' and 3' plastid integration 
targeting sequences. aadA selectable marker cassette, and c/pP promoter / rps163' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized AHAS coding sequence isolated as described 
above. 

Example 42: Preparation of a Chimeric Gene Containing the Tobacco Plastid c/pP Gene 
Promoter and 5' Untranslated Sequence Fused to the ACCase Coding Sequence and 
Plastid rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid 
Transfonnation 

A cDNA library is screened for the acetylcoenzyme A carboxylase (ACCase) gene 
(U.S. Patent No. 5.162,602). A plasmid clone containing the full length ACCase gene 
cDNA is isolated by standard techniques of molecular cloning. PCR primers are designed 
for amplification of the mature-size ACCase coding sequence from this plasmid using a top 
strand primer having a 5' extension containing an Ncol restriction site inserted at amino acid 
-1 from the deduced mature protein start, thus creating an ATG start codon at this position, 
and a bottom strand primer having a 5' extension containing an Xbal restriction site 
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downstream of the stop codon of the ACCase mature coding sequence in the amplified 
PGR product. The PGR amplification is performed using the designated primers and 
plasmid DNA template according to standard protocols. Amplified products are cloned and 
sequenced and a Ncol-Xbal DNA fragment containing the complete mature AGCase 
coding sequence is isolated by restriction digest with Ncol and Xbal, electrophoresis on a 
0.8% TAE agarose gel, and phenol extraction of the excised band. 

A plastid transfoHDation vector containing the cIpP promoter driving the mature-sized 
AGCase gene with the rps16 3' UTR is constructed by digesting pPH140 with Ncol and 
Xbal and purifying the fragment containing the vector backbone, 5' and 3' plastid integration 
targeting sequences, aadA selectable marker cassette, and c/pP promoter / /ps^fi 3' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized AGGase coding sequence isolated as described 
above. 

Example 43: Biolistic Transformation of the Tobacco Plastid Genome 
Seeds of Nicotiana tabacum c.v. 'Xanthi nc' were genninated seven per plate in a 1" 
^circular: array^on T agar mediunr and bombarded 12^14 days after sowing with 1 fjm . 
-tungsten particles (M10, Biorad, Hercules,-GA) coated with DNA from plasmids pPH143 and 
M-pPH1 45-essentialiy*as-described in Svab. Z. and Maliga, P. (1 993) PNAS 90, 91 3-91 7. 
^Bonnbarded^seedlings were incubated on T medium for two days after which leaves were 
'•fexeisedvand placed abaxial side up in bright light (350-500 pmol photons/m^/s) on plates of 
^;tRMOPi:medium (Svab, Z, HajdukiewiczVPVahd Maliga, 'P.' (1990) P/VAS 87, 8526-8530) 
-AGohtaining 500 pg/ml'spectinomycin dihydrochloride"(Sigma, St. Louis, MO). Resistant 
jstJiopts^appearing undemeath the bleached* leaves three to eight weeks after bombardment 
- were subcloned onto the same selective medium, allowed to fonm callus, and secondary 
shoots isolated and subcloned. Complete segregation of transformed plastid genome 
copies (homoplasmicity) in independent subclones was assessed by standard techniques of 
Southern blotting (Sambrook etaL, (1989) Molecular Cloning: A Laboratory Manual, Gold 
Spring Harbor Laboratory, Gold Spring Harbor). BamHI/EcoRI-6\gesXe6 total cellular DNA 
(Mettler, 1. J. (1 987) Plant Mol Biol Reporters, 346-349) was separated on 1 % Tris-borate 
(TBE) agarose gels, transferred to nylon membranes (Amersham) and probed with ^P- 
labeled random primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA 
fragment from pG8 containing a portion of the rps7/12 plastid targeting sequence. 
Homopiasmic shoots are rooted aseptically on spectihomycin-containing MS/IBA medium 
(McBride, K. E. etai (1994) PAMS91, 7301-7305) and transferred to the greenhouse. 
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Example 44: Assessment of Herbicide Tolerance in Nt__pPH143 and NLpPH145 Plastid 

Transformant Lines 

Primary homoplasmic transformant lines transformed with pPH143 (line Nt„pPH143) 
or with pPH145 (line Nt_pPH145), which were obtained as described in Example 43, were 
grown to maturity in the greenhouse. Flowers were either: (a) self-pollinated, (b) pollinated 
with wildtype tobacco (c.v. Xanthi nc), or (c) used as pollen donors to fertilize emasculated 
flowers of wildtype Xanthi plants. Plastid segregation of the linked spectinomycin 
resistance marker was verified by uniparental female inheritance of the spectinomycin- 
resistance phenotype in each transformant line using a minimum of 50 seeds per selection 
pool derived from either selfed or backcross capsules. Additional self or wildtype backcross 
(Xanthi pollen parent) seeds were gemiinated in soil. 36 plants of each line (143 1B-1, 143 
1B-4, 143 4A-2. 143 4A-5, 145 7A-5, 145 7A-6, 145 8A-3) plus 36 wildtype Xanthi plants as 
isogenic controls were grown in separate 6" clay pots in a controlled environment cubicle, 
in order to assess tolerance to the protox inhibitor Fonmula XVII. plants of Xanthi and the 
seven transformant lines were distributed into eight identical 1 6-pot flats (2 plants of each 
type per flat). The flats were sprayed with Formula XVII until runoff at concentrations of 
either 0, 0.5, 2.5, 5, 10, 25, 50, or 100 mg Formula XVII per liter. Solutions were made up 
in water using 4 g/liter or 40 g/liter stock solutions of Formula XVII dissolved in 
dimethylsulfoxide (DMSO) and used immediately after preparation. Twenty microliters of 
the wetting agent Silwett was added to each 200 ml volume of herbicide solution for a final 
concentration of 0.01 %. Flats were sprayed in the late afternoon and allowed to dry 
overnight before transfer to the growth cubicle. Tolerance was assesed by comparing leaf 
damage and wilting to the untransformed Xanthi controls at 0, 18 hrs, 48 hrs, and 6 days 
post-application. Severe damage was apparent on the Xanthi plants at all concentrations 
above 0.5 mg/l, and complete wilting/bum down occurred above 2.5 mg/l. Only slight 
damage occurred on the Nt„pPH143 plants even at the highest concentration (100 
mg/liter), and the plants soon outgrew the bleached spots (the appearance of Xanthi at 0.5 
mg/iiter was approximately equivalent to Nt_pPH1 43 1 B-1 at 1 00 mg/liter, giving a tolerance 
of ca. 200-fold). 

Example 45: Plastid Transformation of Maize 
Type I embryogenic callus cultures (Green etaL (1983) in A. Fazelahmad, K. 
Downey, J. Schultz, R.W. Voellmy, eds. Advances in Gene Technology: Molecular 
Genetics of Plants and Animals. Miami Winter Symposium Series, Vol. 20. Academic Press, 
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N.Y.) of the proprietary genotypes CG00526 and CG00714 are initiated from immature 
embryos, 1.5 - 2.5 mm in length, from greenhouse grown material. Embryos are aseptically 
excised from surface-sterilized ears approximately 14 days after pollination. Embryos of 
CG00526 are placed on D callus initiation media with 2% sucrose and 5mg/L chloramben 
(Duncan etaL (1985) Planta 165: 322-332) while those of CG00714 are placed onto KM 
callus initiation media with 3% sucrose and 0.75mg/L 2,4-d (Kao and Michayluk (1 975) 
Planta 126, 105-1 10). Embryos and embryogenic cultures are subsequently cultured in the 
dark. Embryogenic responses are removed from the explants after --14 days. CG00526 
responses are placed onto D callus maintenance media with 2% sucrose and 0.5mg/L 2,4-d 
while those of CG00714 are placed onto KM callus maintenance media with 2% sucrose 
and 5mg/L Dicamba. After 3 to 8 weeks of weekly selective subculture to fresh 
maintenance media, high quality compact embryogenic cultures are established. Actively 
growing embryogenic callus pieces are selected as target tissue for gene delivery. The 
callus pieces are plated onto target plates containing maintenance medium with 12% 
-sucrose approximately 4 hours prior to gene delivery. The callus pieces are an-anged in 
' Circles, with radii of 8 and 10mm from the center of the target plate. Plasmid DNA is 
;f^precipitated onto gold microcarriers as described in the DuPont Biolistics manual. Two to 
' Hhree jxg.of each plasmid is used in each 6 shot microcarrier preparation. Genes are 
' delivered to the target tissue cells using the PDS-IOOOHe Biolistics device. The settings on 
i24heiiBiGlisticspdevice^are as follows: 8 mm between the rupture disc and the macrocarrier, 
^^^lO mm^between^he macrocamer and the stopping screen and 7 cm between the stopping 
^¥sereen^and the ^target; Each target plate is shot twice using 650psi rupture discs. A 200 X 
Jt200istainless.steel ^mesh (McMaster-Gan-, New Brunswick^ NJ) is placed between the 
?^topping:,sGreenrand the target tissue. 

Five days later, the bombed callus pieces are transfen-ed to maintenance medium 
• with 2% sucrose and 0.5mg/L 2,4-d, but without amino acids, and containing 750 or 1000 
nM Fomiula XVll. The callus pieces are placed for 1 hour on the light shelf 4-5 hours after 
transfer or on the next day, and stored in the dark at 27°C for 5-6 weeks. Following the 5-6 
week primary selection stage, yellow to white tissue is transferred to fresh plates containing 
the same medium supplemented with 500 or 750 nM Fomnula XVll. 4-5 hours aftier transfer 
or on the next day, the tissues are placed for 1 hour on the light shelf and stored in the dark 
at 27°C for 3-4 weeks. Following the 3-4 week secondary selection stage, the tissues are 
transferred to plates containing the same medium supplemented with 500 nM Formula XVll. 
Healthy growing tissue is placed for 1 hour on the light shelf and stored in the dark at 27°C. 
It is subcultured every two weeks until the colonies are large enough for regeneration. 
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At that point, colonies are transferred to a modified MS medium (Murashige and 
Skoog (1962) Physiol. Plant 15: 473-497) containing 3% sucrose (MS3S) with no selection 
agent and placed in the light. For CG00526, 0.25mg/L ancymidol and 0.5mg/L kinetin are 
added to this medium to Induce embryo germination, while for CG00714. 2mg/L benzyl 
adenine is added. Regenerating colonies are transferred to MS3S media without ancymidol 
and kinetin, or benzyl adenine, for CG00526 or CG00714, respectively, after 2 weeks. 
Regenerating shoots with or without roots are transferred to boxes containing MS3S 
medium and small plants with roots are eventually recovered and transferred to soil in the 
greenhouse. 

Example 46: Determination of the NHa-terminus of the mature maize 

protoporphyrinogen oxidase 

Maize seedlings are grown 6 days at 27°C in the dark and illuminated 30 min prior to 
recovering etioplasts. Leaves are homogenised in extraction buffer (20 mM TES, 10 mM 
Hepes, 0.5 M sucrose, 1 mM MgCIa, 1 mM EDTA, 0.2% BSA, pH 7.7, ±5 mM DTT) and 
filtered through cheesecloth layers. After centrifugation (7 min x lOOOOg) the pellet is 
resuspended in the same buffer and centrifuged 4 min x 150g. The supernatant is 
centrifuged 10 min x SOOOg, and the pellet resuspended in a small amount of buffer. 

For immunoprecipitation, the chloroplast fraction is diluted 5 fold in 20mM K- 
phosphate buffer, pH 7.0, 0.2 % Triton X-100, put on ice for 30 min and sonicated (3x6 
sec) before being mixed with Protein A Sepharose GL-4B (Pharmacia), The supernatant is 
recovered by centrifugation and incubated with Protein A Sepharose CL-4B coupled to 
purified anti- protoporphyrinogen oxidase antibodies (anti- protox-1). The resin is washed 
with 20 mM Na-Phosphate buffer, pH 7.0, and specifically bound proteins are eluted by 
boiling in 3% SDS. Detergent is precipitated at 4°C, the supernatant lyophilised by speed- 
vacuum and resuspended in 20 mM Na-Phosphate buffer, pH 7.0. 

Proteins are separated by SDS-electrophoresis according to Laemmli (1 970) Nature 
(London) 227, 680-685, and silver stained using a Biorad (Hercules, CA) kit. The proteins 
separated by SDS-electrophoresis are blotted onto nitrocellulose (Millipore). Purified anti- 
protoporphyrinogen oxidase antibodies are diluted 1:15,000 before use. Anti rabbit IgG 
alkaline phosphatase conjugates (Sigma) are used as secondary antibodies and 
protoporphyrinogen oxidase bands are stained by using a nitroblue tetrazoliunn/ 5-bromo-4- 
chloro-3-indolyl phosphate kit (Biorad). 

Protein samples for NH2-sequencing are prepared as previously described (de 
Marco, A., Guzzardi, P., & Jamet, E. (1999) Plant Physiol. 120, 371-381), and Edman 
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analysis is carried out in a model 477A protein sequencer (Applied Biosystems Inc., Foster 
City, CA) according to the recommendations of the manufacturer. The NH2-tenninus of the 
mature maize protoporphyrinogen oxidase (protox-1) is found to be located at the alanine at 
position 1 in the amino acid sequence set forth in SEQ ID NO:45. 

Purified recombinant protoporphyrinogen oxidase (see e.g. this application) is used 
to raise antibodies in rabbit using methods well known in the art. The antibodies are 
purified from sera using EMDTA Fractogel (Merck). 

Example 47: Recombinant expression of the mature maize protoporphyrinogen oxidase 

A, Construction of expression vectors and cell transformation. 

The pET16b and pET28 bacterial expression vector (Novagen, Madison, USA), 
based on the T7 promoter driven system and containing a 5' His-tag is used. The coding 
sequence of the mature maize protox-1 is amplified by PGR. The PGR product is cloned 
ihto the BamHI site of pET16b. A plasmid containing the connect orientation of the insert is 
•identified by sequencing and used for transfomnation of competent BL21 (DE3) and BL21 
'(DL3)pLysS cells. 

B. ;ReGombinant:proteinj isolation. , . ^ 

' Ei; CO// cells; !transformed with*the vector hosting the mature maize 
protoporphyrinogen oxidase coding sequence, are cultured 16 h at SS^C under constant 
.^agitatiorif Isopropyl-pr-D-thiogalactopyr (IPTG).is^ added to the medium 2 h before 

-viibacterial-recovery. Alternative growth conditions are also used to increase the accumulation 
'^-of^^active protein: Rrst, 'indue heat-shock proteins is stimulated by culturing the 
-^b^Gteria:1nitiallyjat*30?G^and then»increasing to 42°ain'the presence of 3% ethanol. 
'.^Second, the uptake of the compatible solute glycyl betaine is perfomned using sorbitol as an 
osmotic. Bacterial cells are recovered by centrifugation (15 min x 7500g) and stored at - 
80^C. The bacterial cells are resuspended in extraction buffer (40 mM Na-Phosphate, pH 
7.2, 300 mM NaCI, 20% glycerol. 0.2% Triton X-100, 20 mM imidazole), sonicated (6x6 
sec), and then centrifuged (25 min x lOOOOOg). The supernatant Is mixed with 2 mL of pre- 
washed immobilized nickel affinity chromatography resin (Talon, Clontech Lab.) in a 50 mL 
Falcon tube and gently rocked for 10 min, at 4°C. After centrifugation (2 min x 700 rpm), the 
supernatant is removed and the resin washed in extraction buffer (2X). Different protocols 
are used to complete the protein purification and described below. 

Method 1): The resin is used to load a column and subsequently washed with 10 
volumes of washing buffer (50 mM EPPS, pH 7.2, 20 % glycerol, 0.1 % TritonX -1 00). Bound 
proteins are eluted with the washing buffer containing 100 mM EDTA, desalted and buffer 
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exchanged (50 mM EPFS, pH 8.5, 1 mM EDTA) using PD-10 Sephadex G-25 columns 
(Pharmacia), and loaded onto an ion-exchange Mono-Q FPLC column (Pharmacia). 
Proteins are eluted using a 0-1 M NaCI gradient. Fractions corresponding to the different 
proteins are collected separately, concentrated by speed-vacuum and dialysed against 
washing buffer. Altematively, fractions recovered by affinity chromatography are buffer 
exchanged with 20 mM EPFS, pH 7.2, 150 m!^ NaCI and are loaded (- 150 ^iL 

corresponding to 150 ng protein) onto a pre-equilibrated gel-filtration Superdex 75 column 
(Pharmacia). 

Method 2): Proteins are eluted as in Method 1), buffer exchanged against washing 
buffer, loaded onto a column, washed twice in ATP buffer (50 mM EPFS, pH 7.2, 20 % 
glycerol, 5 mM ATP, 7 mM MgCI2, 0.1% TritonX -100) and eluted with imidazole buffer (50 
mM EPFS, pH 7.2, 20% glycerol, 0.1% TritonX -100, 1 M imidazole). 

Method 3): After the initial extraction buffer washing steps, the metal affinity resin is 
resuspended in ATP buffer and agitated for 30 min at 4 °C. The resin is recovered by 
centrifugation and washed again in the ATP buffer before being loaded onto a column and 
re-equilibrated with 50 mM EPFS, pH 7.2, 20% glycerol, 0.1% TritdnX -100. Protein is 
eluted with imidazole buffer (described above). In some extractions, phospholipids (Sigma, 
St. Louis) were added to the elution buffer (L-a-phosphatidylcholine; L-a- 
phosphatidylcholine, dioleoyi; L-a-phosphatidyl-ethanolamine; L-a-phosphatidyl-L-serine; L- 
a-phosphatidylinositol; L-a-lysophosphatidylinositol; L-a-lysophosphatidylcholine; L-a- 
iysophosphatidyl-L-serine; L-a-lysophosphatidylethanolamine). 

Resuspension of heat denaturated protein and of protein from homogenate pellet is 
performed using extraction buffer or re-equilibration buffer containing either phospholipides 
or chaperones. Proteins not immediately used for enzymatic tests are stored at -80 "C. The 
His-tag is removed by incubation in the presence of Factor Xa (New England Biolabs), 
according to manufacturer's instructions. 



Table 1A 

Alignment of the full-length and partial protox-1 amino acid sequences from 
Arabidopsis ("Arabpt-1"; SEQ ID N0:2), maize ("Mzpt-1"; SEQ ID NO:6), wheat ("Wtpt-1"; 
SEQ ID NO:10), soybean ("Soybeanpt-1"; SEQ ID N0:12). cotton ("Cottonpt-1 "; SEQ ID 
NO:16), sugar beet ("Sugpt-I"; SEQ ID N0:18), oilseed rape ("Rapept-I"; SEQ ID NO:20), 
rice ("Ricept-I"; SEQ ID NO:22), sorghum ("Sorghumpt-I"; SEQ ID NO:24), and sugarcane 
("Scpt-1"; SEQ ID NO:37). Alignment was performed using the PileUp program (GCG 
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package, University of Wisconsin, Madison, Wl). Positions that may be modified according 
to the teachings herein to confer or enhance inhibitor resistance are shown in bold type. 



1 .50 

Rapept-1 MDLSLLRP. . QPFLSPFSNP FPRSRPYKPL 

Arabpt-1 MELSLLRPTT QSLLPSFSKP NLRLNVYKPL 

Sorghumpt-l 

Mzpt-1 

Wtpt-1 M ATATVAAASP LRGRVTGRPH 

Ricept-1 

Cottonpt-1 MTAL IDLSLLRSSP SVSPFSIPHH QHPPRFRKPF 

Soybeanptl MV SVFNEILFPP NQTLLRPSLH SPTSFFTSPT RKFPRSRPNP 

Sugpt-1 MKSMALSNCI PQTQCMPLRS SGHYRGNCIM LSIPCSLIGR RGYYSHKKRR 

Scpt-1 

51 100 

Rapept-1 NLRCSVSGGS WGSSTIEGG GGGKTVTADC VIVGGGISGL CIAQALVTKH 

Arabpt-1 RLRCSVAGGP TVGSSKIEGG GGT.TITTDC VIVGGGISGL CIAQALATKH 

Sorghuitipt-1 

Mzpt-1 ADC VWGGGISGL CTAQALATRH 

Wtpt-1 RVRPRCATAS SATETPAAPG VRL . . . SAEC VIVGAGISGL CTAQALATRY 

Ricept-1 

Cottonpt-1 KLRCSLAEGP TISSSKIDGG ESS ... lADC VIVGGGISGL CIAQALATKH 

■Soybeanptl ILRCSIAEES TASPPKTR. . DSA. . .PVDC WVGGGVSGL CIAQALATKH. 

^Sugpt-l MSMSCiSTSSG SKSAVKEAGS GSGAGGLLDC VIVGGGISGL CIAQALCTKH 

'Scpt-1 .-^ Vv';'t'-i'i- . . . . . . 
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101 

Rapept-1 PDA..AKNVM VTEAKDRVGG NUT. .REEQ GFLWEEGPKS FQPSDPMLTM 

Arabpt-1 PDA. .APNLI VTEAKDRVGG NUT. .REEN GFLWEEGPNS FQPSDPMLTM 

Sorghumpt-l STVERPEE GYLWEEGPNS FQPSDPVLSM 

Mzpt-1 ..G..VGDVL VTEARARPGG NITTVERPEE GYLWEEGPNS FQPSDPVLTM 

_Wtpt-l ..G..VSDLL VTEARDRPGG NITTVERPDE GYLWEEGPNS FQPSDPVLTM 

Ricept-1 

Cottonpt-1 RDV. .ASNVI VTEARDRVGG NITTVER.' '.O GYLWEEGPNS FQPSDPILTM 

Soybeanptl . .A. .NANW VTEARDRVGG NITTMER. .D GYLWEEGPNS FQPSDPMLTM 

Sugpt-1 SSSSLSPNFI VTEAKDRVGG NIVTVE. .AD GYIWEEGPNS FQPSDAVLTM 



Rapept-1 
Arabpt-1 
Sorghximpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorgh\impt-l 
Mzpt-l 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



151 

WDSGLKDDL 
WDSGLKDDL 
AVDSGLKDDL 
AVDSGLKDDL 
AVDSGLKDDL 



VLGDPTAPRF 
VliGDPTAPRF 
VPGDPNAPRF 
VPGDPNAPRF 
VPGDPNAPRF 



VLWNGKLRPV 
VLWNGKLRPV 
VLWEGKLRPV 
VLWEGKLRPV 
VLWEGKLRPV 



PSKLTDLPFF 
PSKLTDLPFF 
PSKPADLPFF 
PSKPADLPFF 
PSKPGDLPFF 



200 

DLMSIGGKIR 
DLMSIGGKIR 
DLMSIPGKLR 
DLMSIPGKLR 
SLMSIPGKLR 



AVDSGLKDDL VLGDPNAPRF VLWEGKLRPV PSKPTDLPFF DLMSIAGKLR 
WDSGLKDEL VLGDPDAPRF VLWNRKLRPV PGKLTDLPFF DLMSIGGKIR 
AVDSGLKDEL VLGDPNAPRF VLWNDKLRPV PSSLTDLPFF DLMTIPGKIR 



201 

AGFGAIGIRP 
AGFGALGIRP 
AGLGALGIRP 
AGLGALGIRP 
AGLGALGIRP 



SPPGREESVE 
SPPGREESVE 
PAPGREESVE 
PPPGREESVE 
PPPGREESVE 



EFVRRNLGDE 
EFVRRNLGDE 
EFVRRNLGAE 
EFVRRNLGAE 
EFVRRNLGAE 



VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 



250 

SGVYAGDPAK 
SGVYAGDPSK 
SGVYAGDPSK 
SGVYAGDPSK 
SGVYAGDPSK 



AGFGAIGIRP PPPGYEESVE EFVRRNLGAE VFERFIEPFC SGVYAGDPSK 
AGFGALGIRP PPPGHEESVE EFVRRNLGDE VFERLIEPFC SGVYAGDPSK 
AALGALGFRP SPPPHEESVE HFVRRNLGDE VFERLIEPFC SGVYAGDPAK 



251 

LSMKAAFGKV 
LSMKAAPGKV 
LSMKAAFGKV 
LSMKAAFGKV 
LSMKAAFGKV 
RALKAAFGKV 
LSMKAAFGRV 
LSMKAAFGKV 
LSMKAAFGKV 



WKLEENGGSI 
WKLEQNGGSI 
WRLEEAGGSI 
WRLEETGGSI 
WRLEEIGGSI 
WRLEDTGGSI 
WKLEEIGGSI 
WKLEKNGGSI 
WKLEQKGGSI 



IGGAFKAIQA 
IGGTFKAIQE 
IGGTIKTIQE 
IGGTIKTIQE 
IGGTIKAIQD 
IGGTIKTIQE 
IGGTFKTIQE 
IGGTFKAIQE 
IGGTLKAIQE 



KNKAPKTTRD 
RKNAPKAERD 
RGKNPKPPRD 
RSKNPKPPRD 
KGKNPKPPRD 
RGKNPKPPRD 
RNKTPKPPRD 
RNGASKPPRD 
RGSNPKPPRD 



300 

PRLPKPKGQT 
PRLPKPQGQT 
PRLPKPKGQT 
ARLPKPKGQT 
PRLPAPKGQT 
PRLPTPKGQT 
PRLPKPKGQT 
PRLPKPKGQT 
QRLPKPKGQT 
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Rapept-1 
Arabpt-1 
Sorghunrpt-1 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorghunipt-1 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
^-Sugpt-1 
=;ScptT-l 



301 

VGSFRKGLTM 
VGSFRKGLRM 
VASFRKGIiAM 
VASFRKGLAM 
VASFRKGLAM 
VASFRKGLTM 
VGSFRKGLTM 
VGSFRKGLTM 
VGSFRKGLVM 



LPEAISARLG 
LPEAISARLG 
LPNAITSSLG 
LPNAITSSLG 
LPNAIASRLG 
LPDAITSRLG 
LPEAIANSLG 
LPDAISARLG 
LPTAISARLG 



DKVKVSWKLS 
SKVKLSWKLS 
SKVKLSWKLT 
SKVKLSWKLT 
SKVKLSWKLT 
SKVKLSWKLT 
SNVKLSWKLS 
NKVKLSWKLS 
SRVKLSWTLS 



SITKLASGEY 
GITKLESGGY 
SMTKSDGKGY 
SITKSDDKGY 
SITKADNQGY 
SITKSDNKGY 
SITKLGNGGY 
SISKLDSGEY 
SIVKSLNGEY 



350 

SLTYETPEGI 
NLTYETPDGL 
VLEYETPEGV 
VLEYETPEGV 
VLGYETPEGL 
ALVYETPEGV 
NLTFETPEGM 
SLTYETPEGV 
SLTYDTPDGL 



351 

VTVQSKSA/VM 
VSVQSKSWM 
VLVQAKSVIM 
VSVQAKSVIM 
VSVQAKSVIM 
VSVQAKTWM 
VSLQSRSWM 
VSLQCKTWL 
VSVRTKSWM 



TVPSHVASSL 
TVPSHVASGL 
TIPSYVASDI 
TIPSYVASNI 
TIPSYVASDI 
TIPSYVASDI 
TIPSHVASNL 
TIPSYVASTL 
TVPSYVASRL 



LRPLSDSAAE 

LRPLSESAAN 

LRPLSGDAAD 

LRPLSSDAAD 

LRPLSIDAAD 

LRPLSSDAAD- 

LHPLSAAAAD 

LRPLSAAAAD 

LRPLSDSAAD 



ALSKLYYPPV 
ALSKLYYPPV 
VLSRFYYPPV 
ALSRFYYPPV 
ALSKFYYPPV 
ALSIFYYPPV 
ALSQFYYPPV 
ALSKFYYPPV 
SLSKFYYPPV 



400 

AAVSISYAKE 
AAVSISYPKE 
AAVTVSYPKE 
AAVTVSYPKE 
AAVTVSYPKE 
AAVTVSYPKE 
ASVTVSYPKE 
AAVSISYPKE 
AAVSLSYPKE 



-t1 

->Arabpt^l 
Sorghuinpt^l 
TMzpt^l 
-Wtpt-1 
*^Rd^G,jeptr-l 
Cottbnptr-1 
-Soybeanptl 
■^'^SAigpt-r-l 
i^l 



401 

.xf .^^IBRS EGLI*I)Gii^ELKGFGQLHP - RTQKVETLGT 
--'•■'ii€'R:TtEGL IDG?HELKG^^ RTQGVETLGT 
^^'ip-RaC'ECI/^^^ ' RSQGVETLGT 

.%'rA3>RKECLI-DG^^ELQGFiGQLHP RSQGVETLGT 

/AIRKEGLIDG ^'ELQGFGQLHP RSQGVETLGT 
^^A$RKEGLiBDG:'*ELQG^:.G^^^^ RSQGVETLGT 
AIRKECLIDG ^ELKGFGQLHP • RSQGIETLGT 

t A-IRSEGLIDG riELKGFGQLHP vRSQGVETLGT 
- AIRSECL3fNGi6ELQGFGQLHP. RSQGVETLGT 



lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
IYSSSLFP3TO 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPGR 



450 

APPGRVLLLN 
APPGRILLLN 
APAGRVLLLN 
APDGRVLLLN- 
APAGRVLLLN 
APAGRVLLLN 
APSGRVLLLN- 
APPGRVLLLN 
APPGRILILS 



^i-^Rapeptrl 
Arabptrl 
Sorghumpt-1 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



^451 

YIGGATNTGI 
YIGGSTNTGI 
YIGGATNTGI 
YIGGATNTGI 
YIGGSTNTGI 
YIGGSTNTGI 
YIGGATNTGI 
YIGGATNTGI 
YIGGAKNPGI 



LSKSEGELVE 
LSKSEGELVE 
VSKTESELVE 
VSKTESELVE 
VSKTESDLVG 
VSKTESELVE 
LSKTEGELVE 
LSKTDSELVE 
LNKSKDELAK 
. SKTESELVE 



AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
TVDRDLRKIL 
TVDKDLRRML 
AVDRDLRKML 



IKPSSTDPLV 
IKPNSTDPLK 
INPTAVDPLV 
INSTAVDPLV 
INPRAADPLA 
INPRAVDPLV 
INPNAKDPLV 
INPNAQDPFV 
INPDAKLPRV 
INPTAVDPLV 



500 

LGVKLWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPKAI 
VGVRLWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
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501 

Rapept-1 PQFLIGHIDL VDAAKASLSS 

Arabpt-1 PQFLVGHFDI LDTAKSSLTS 

Sorghumpt-l PQFLVGHLDL LEAAKSALDQ 

Mzpt-l PQFLVGHLDL LEAAKAALDR 

Wtpt-1 PQFLIGHLDR LAAAKSALGQ 

Ricept-1 PQFLIGHLDH LEAAKSALGK 

Cottonpt-1 PQFLVGHLDL LDSAKMALRD 

Soybeanptl PQFLVGHLDL LDVAKASIRN 

Sugpt-1 PQFSIGHFDL LDAAKAALTD 

Scpt-1 PQFLVGHLDL LEAAKSALDR 



550 

SGHEGLFLGG NYVAGVALGR CVEGAYETAT 
SGYEGLFLGG NYVAGVALGR CVEGAYETAI 
GGYNGLFLGG NYVAGVALGR CIEGAYESAA 
GGYDGLFLGG NYVAGVALGR CVEGAYESAS 
GGYDGLFLGG KYVAGVALGR CIEGAYESAS 
GGYDGLFLGG NYVAGVALGR CVEGAYESAS 
SGFHGLFLGG NYVSGVALGR CVEGAYEVAA 
TGFEGLFLGG NYVSGVALGR CVEGAYEVAA 
TGVKGLFLGG NYVSGVALGR CIEGAYESAA 
GGYDGLFLGG NYVAGVALGR CVEGAYESAS 



551 563 

Rapept-1 QVNDFMSRYA YK* 

Arabpt-1 EVNNFMSRYA YK* 

Sorghiimpt-l QIYDFLTKYA YK* 

Mzpt-l QISDFLTKYA YK* 

Wtpt-1 QVSDFLTKYA YK* 

Ricept-1 QISDYLTKYA YK* 

Cottonpt-1 EVKEFLSQYA YK* 

Soybeanptl EVNDFLTNRV YK* 

Sugpt-1 EWDFLSQYS DK* 

Scpt-1 QIYDFLTKYA YK* 



Table 1 B 



-sequences of herbicide-tolerant protox enzymes comprising point mutations. 



Sub- 
sequence 
# 


Sub- 
sequence AA 
sequence 


An AA 

wild-type 


AnAA 

substitutions 


Corresponding 
AA position of 
An in Table 1 A 


Exemplary 
mutants 


1 


APAiF 


R 


C,L 


169 


MzSSCys 


2 


FA2S 








Mz88Leu 




"c 




240 


Mz159Phe 












l\/Iz159Leu 
IVIz159Lys 


3 


YA3G 


A 


V, L, C, 1 


245 


pAraC-1Val 
pAraC-IThr 
pAraC-1 Leu 
pAraC-1 Cys 
pAraC-1 lie 
pMzC-lVal 
pMzC-1Thr 
pMzC-1 Leu 
pWhtC-1Val 
pWhtC-1Thr 
pSoyC-1Thr 
pSovC-1 Leu 
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A 
H 


AA4D 


ri 


Q 1 
0, L 




pArao-ober 
























pMzC-3Leu 












pWhtC-3Ser 


5 


YAsP 


p 


S. H 


388 


Sov369Ser 












Sov369His 












Cot365Ser 


6 


PArA 


V 


L 


390 


Wht356Leu 


7 


A in 


Y 

1 


r: 1 1 T K/i 

1, u, 1 , IVI, 




pArau-iiuys 








\/ A H R 

V, rt, n, n 




pArau-/iiie 












pArau-iiLeu 












pArau-^ 1 nr 












pArao-iiMGt 












pArau-2vai 












pAraC-2Ala 












pMzC-2lle 












pMzc-2Met 












pooyo-2Leu 












pooyu-2iie 








^ i 




pougv-z-^uys 












pSugC-2Leu 












r*Qi inf^ Olio 
poUyO'iflllS 












UwLlUV^ £. V Cll 


.1-* 




• 










- 






^ * 1 


DCotC*2Cvs 






• 






nCntn-PHi<; 


- 


> ^ - ... . . ,\ 








pvoi^-iiAry 


8 : 


'■YiGGAn 


A, S 


P , 


455 


Wht4P1 Prn 

V V 1 1 l*T^ 1 1 t W 


y 


AA9P 


1 ' 
1 


T II /-S 1 ' 

T, n, G, N 


500 


Mz419Thr 












iviZ4i ynis 












[vlz4lyGly 




1 








Mz419Asn 












Wht466Thr 


AO 




;V 


A ' - 


*536 


Wht502Ala 












Sov51 7Ala 


18 


KAAisF 


A 


T,V 


256 


Mz175Thr 












Mz175VaI 


19 


QA19H 


L 


S 


418 


Mz337Ser 


Second-site mutations 


11 


QA11S 


P 


L 


143 


AraC118Leu 


12 


IGGA12 


T 


l,A 


274 


AraC249lle 












AraC249Ala 


13 


SWXLA13 


S.T 


L 


330 


AraC305Leu 


14 


LA14Y 


N 


S 


450 


AraC425Ser 


15 


GA15XGL 


Y, H, F, V 


C 


523 


AraC498Cys 
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Double mutatic 


in 


16 


TAisG 


L 


S 


428 


Mz347Ser- 
453Thr 


17 


YVAirG 


A, (S) 


T 


534 





Table 2 

Comparison of the Arabidopsis (SEQ ID NO:4) and maize (SEQ ID NO:8) protox-2 
amino acid sequences. Identical residues are denoted by the vertical bars between the two 
sequences. Alignment wasperfomied using the GAP program described in Deveraux etal., 
Nucleic Acids Res. 72:387-395 (1984). Percent similarity: 75.889 / percent identity: 57.905. 

MASGAVAD.HQIEAVSGKRVAV 21 

• I h h • : I • • : : • I I 
1 MLALTASASSASSHPYRHASAHTRRPRLRAVLAMAGSDDPRAAPARSVAV 50 

22 VGAGVSGLAAAYKLKSRGLNVTVFEADGRVGGKLRSVMQNGLIWDEGANT 71 

Mlllllilllhl: •|:|||||||.:|.|||:|. :.|::||||||| 
51 VGAGVSGLAAAYRLRQSGVNVTVFEAADRAGGKIRTNSEGGFVWDEGANT 100 

72 MTEAEPEVGSLLDDLGLREKQQFPISQKKRYIVRNGVPVMLPTNPIELVT 121 

101 MTEGEWEASRLIDDLGLQDKQQYPNSQHKRYIVKDGAPALIPSDPISLMK 150 

122 SSVLSTQSKFQILLEPFLWKK KSSKVSDASAEESVSEFFQRHFGQB 167 

= - I : | | | : • :|||| | 

151 SSVLSTKSKIALFFEPFLYKKANTRNSGKVSEEHLSESVGSFCERHFGRE 200 

168 VVpYLlDPFVGGTSAADPDSLSMKHSFPDLWNVEKSFGSIIVGAIRTKFA 217 

I II 1^ = 1111 = 111 hi II II .|:| 

201 VVDYFVDPFVAGTSAGDPESLSIRHAFPALWNLERKYGSVIVGAILSKLA 250 

218 AKGGKSRDTKSSPGTKKGSRGSFSFKGGMQILPDTLCKSLSHDEINLDSK 267 

lll=- =• ••l-l-:=-.|-lll|.|||| I :.| ..::.|::.l:.. 
251 AKGDPVKTRHDSSGKRRNRRVSFSFHGGMQSLINALHNEVGDDNVKLGTE 300 

268 VLSLS. .YNSGSRQENWSLSCVSHNETQRQ. . .NPHYDAVIMTAPLCNVK 312 

nil- = = = ■• =-lhl- |. |. : | | || | | | | | : | | : 
301 VLSLACTFDGVPALGRWSISVDSKDSGDKDLASNQTFDAVIMTAPLSNVR 350 

313 EMKVMKGGQPFQLNFLPEINYMPLSVLITTFTKEKVKRPLEGFGVLIPSK 362 

II- lll-l- l=lll-==l=lll===l-l-|:-l|:|||||||||| I 
351 RMKFTKGGAPWLDFLPKMDYLPLSLMVTAFKKDDVKKPLEGFGVLIPYK 400 

363 E.QKHGFKTLGTLFSSMMFPDRSPSDVHLYTTFIGGSRNQELAKASTDEL 411 
.n. I lll'=INIIIIIIIIIII|.|-| -11111:111:1:11 M- | 
401 EQQKHGLKTLGTLFSSMMFPDRAPDDQYLYTTFVGGSHNRDLAGAPTSIL 450 
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412 KQVVTSDLQRLLGVEGEPVSVNHYYWRKAFPLYDSSYDSVMEAIDKMEND 461 

Ihillll-lllllhl. M II .|IM|: .Mhllhlll.: 

451 KQLVTSDLKKXLGVEGQPTFVKHVYWGNAFPLYGHDYSSVLEAIEKMEKN 500 

462 LPGFFYAGNHRGGLSVGKSIASGCKAADLVISYLESCSNDKKPNDSL* 509 

I I 11 III M ::||-1|. lllhllllMIIIII : 

501 LPGFFYAGNSKDGLAVGSVIASGSKAADLAISYLESHTKHNNSH* . . . 545 



Table 3A 

Cross tolerance of plant protox mutants to various protox inhibitors. 



Formula 


AraC-1Val 


AraC-2Cvs 


1 

AraC-1Thr 


AraC-3Thr 


MzC-1Val 


XVII 


+ 


+ 


+ 


+ 


+ 


Vila 


+ 




+ 




+ 


IV 


++ 




++ 


++ 




XV 


+ 


+ 


+ 


+ 


.+ 


XI 




+ 


+ 


++ 


+ 


''XVI 










+ 


^Xll . 


+ 






++ 


++ 


txiv 


+ 






+ 




♦V 













^ .=t^1iOX*^or^nipre4olerantithan W 
H-+ = 1 0OX^or^rripre^olerant than WT 
4-^=.no cross 'tolerance 

"'^#ithis^e.Qpppund^^ provided no infonnation 



, Table 3B 

: *©ross tolerance, of plant protox mutants to^various protox inhibitors. 





AraC- 


AraC- 


AraC" 


AraC- 


AraC- 


AraC- 


AraC- 


AraC- 




1Leu 


2lle 


ILeu 


1Leu 


2l]e 


2Cys 


2Leu 


2Met 








+ 


+ 


+ 


+ 


+ 


+ 








AraC- 


AraC- 


AraC 


AraC 


AraC 


AraC 


Fonmula 






2Met 


2Leu 


SOSLeu 


425Ser 


425Ser 


425Ser 


XVII 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


Vila 


++ 


++ 




++ 




++ 


++ 


++ 


IV 


++ 




+ 


-H- 


+ 




+ 


+ 


XV 


++ 


+++ 


+++ 




+++ 


++ 


1 1 1 

1 1 r 


++ 


XI 


++ 


++ 


++ 


++ 


1,, 1 


++ 


++ 


++ 


XVI 


+++ 


+++ 


+++ 


+-H- 


+++ 


+ 


++ 


++ 


XII 


















XIV 


n 


++ 


++ 


++ 






++ 


++ 
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Table 4 

Cross tolerance to various protox inhibitors in a seed germination assay. 



Formula 


Common name 


Tolerance 


II 


acifluorofen 


+ 


III 


fomasafen 


+ 


IV 


fluoroglycofen 


± 


IVb 


bifenox 


+ 


IVC 


oxyfluorofen 


+ 


IVd 


lactofen 


± 


Vila 


fluthiacet-methvl 


++ 


X 


sulfentrazone 


+ 


XI 


flupropazil 




XIV 


flumiclorac 

_ . .. 


+ 


XVI 


flumioxazin 


+++ 


XVII 




++ 


XXIa 


BAY 11 340 


+ 


XXII 




-H- 



± < 1 0X more tolerant than wt 
+ > 10X more tolerant than wt 
•H- > 100X more tolerant than wt 
+++ > 1000X more tolerant than wt 



Various modifications of the invention described herein will become apparent to 
those skilled in the art. Such modifications are intended to fall within the scope of the 
appended claims. 



115 



wo 01/68826 

What Is Claimed Is: 



PCT/EPOl/02753 



1. An isolated DNA molecule comprising a coding sequence that encodes the amino 
acid sequence set forth in SEQ ID NO:45. 

2. The DNA molecule according to claim 1 , wherein said DNA molecule comprises the 
coding sequence set forth in SEQ ID NO:44. 

3. A chimeric gene comprising a promoter active in a plant operatively linked to the 
DNA molecule according to claim 1 . 

4. The chimeric gene according to claim 3, additionally comprising a signal sequence 
operatively linked to said DNA molecule, wherein said signal sequence targets the amino 
acid sequence encoded by said coding sequence to a plastid. 

5. The chimeric gene according to claim 3, additionally comprising a signal sequence 
^>i^operativelyilinked.tO;Said DNA moleculev wherein said signal sequence targets the amino 
-iiaeid-sequences^encoded by said coding sequence to a mitochondria. 

6. ^^A>*Fecombinant vector comprising the chimeric gene according to claim 3. 
S J, A^host-ciBll^comprising the chimeric gene according to claim 3. 

^^^8. The^hGStcell according to claim 7,- wherein said host cell is selected from the group 
-r;cpnsisting of a plant cell, a bacterial ceil, a yeast cell, and an insect cell. 

9. The host cell according to claim 8, wherein said host cell is a plant cell. 

10. A chimeric gene comprising a promoter functional in a plant plastid operatively linked 
to the DNA molecule according to claim 1 . 

11. The chimeric gene according to claim 10, wherein said plant plastid promoter is a 
c/pP gene promoter. 
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12. The chimeric gene according to claim 10, further comprising a 5' untranslated 
sequence (5'UTR) from said plastid promoter and a plastid gene 3' untranslated sequence 
(3' UTR) operatively linked to said DNA molecule. 

13. The chimeric gene according to claim 12, wherein said plant plastid promoter is a 
c/pP gene promoter, and wherein said 3' UTR is a plastid rps^6 gene 3' untranslated 
sequence. 

14. A plastid transformation vector comprising the chimeric gene according to claim 12. 

15. A plant plastid comprising the plastid transformation vector according to claim 14. 

1 6. A plant comprising the plant cell according to claim 9. 

1 7. Seed from the plant according to claim 1 6. 

1 8. A plant cell comprising the plant plastid according to claim 1 5. 

1 9. A plant comprising the plant cell according to claim 18. 

20. Seed from the plant according to claim 1 9. 

21 . An isolated polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 45, 
wherein said polypeptide has protoporphyrinogen oxidase (protox) activity. 

22. A method comprising: 

(a) obtaining a host cell comprising a DNA molecule according to claim 1 ; and 

(b) expressing a polypeptide comprising the amino acid sequence set forth in SEQ 
ID NO:45 in said host cell. 

23. The method according to claim 22, wherein said host cell is a bacterial cell, a yeast cell 
or an insect cell. 

24. The method according to claim 23, wherein said host cell is E colL 
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25. A method for assaying a chemical for the ability to inhibit the activity of the polypeptide 
according to claim 21 comprising: 

(a) combining said polypeptide and protoporphyrinogen IX in a first reaction mixture 
under conditions in which said polypeptide catalyzes the conversion of said 
protoporphyrinogen IX to protoporphyrin IX; 

(b) combining a chemical, said polypeptide and protoporphyrinogen IX in a second 
reaction mixture under the conditions of step (a); 

(c) measuring the fluorescence emission of said first and said second reaction 
mixtures at about 622 to about 635 nM after excitation of said first and said second reaction 
mixtures at about 395 to about 410 nM; 

(d) comparing the fluorescence emission of said first and said second reaction 
mixtures at about 622 to about 635 nM; 

(e) selecting a chemical that inhibits the activity of said polypeptide if the 
fluorescence emission of said second reaction mixture is less than the fluorescence 
emission of said first reaction mixture and if the decrease in fluorescence emission 
between said 'first and second reaction mixtures is larger than the margin of error inherent in 

*-^the-'measurement5of^said^fluorescence' emission. * 



118 



wo 01/68826 



PCT/EPOl/02753 



SEQUENCE LISTING 
<110> Syngenta Participations AG 

<120> PROTOPORPHYRINOGEN OXIDASE ("PROTOX") GENES 
<X30> PB/5-31284A 

<140> 
<141> 

<150> US 09/525,152 1 
<151> 2000-03-14 i . .__ 

<160> 47 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1719 
<212> DNA 

<213> Arabidopsis tha liana 

<220> 
<221> CDS 

<222> (31) . . (1644) 

<223> Arabidopsis protox-1 

<400> 1 

tgacaaaatt ccgaattctc tgcgatttcc atg gag tta tct ctt etc cgt ccg 54 

Met Glu Leu Ser Leu Leu Arg Pro 
1 5 

acg act caa teg ctt ctt ccg teg ttt teg aag cec aat etc cga tta 102 
Thr Thr Gin Ser Leu Leu Pro Ser Phe Ser Lys Pro Asn Leu Arg Leu 
10 15 20 

aat gtt tat aag cct ctt aga etc cgt tgt tea gtg gee ggt gga eca 150 
Asn Val Tyr Lys Pro Leu Arg Leu Arg Cys Ser Val Ala Gly Gly Pro 
25 30 35 40 

ace gte gga tct tea aaa ate gaa gge gga gga ggc ace acc ate acg 198 
Thr Val Gly Ser Ser Lys He Glu Gly Gly Gly Gly Thr Thr He Thr 

45 50 55 

acg gat tgt gtg att gtc ggc gga ggt att agt ggt ctt tgc ate get 246 
Thr Asp Cys Val He Val Gly Gly Gly He Ser Gly Leu Cys He Ala 

60 65 70 

cag geg ctt get act aag eat cct gat get get ccg aat tta att gtg 294 
Gin Ala Leu Ala Thr Lys His Pro Asp Ala Ala Pro Asn Leu He Val 
75 80 85 

ace gag get aag gat cgt gtt gga ggc aac att ate act cgt gaa gag 342 
Thr Glu Ala Lys Asp Arg Val Gly Gly Asn He He Thr Arg Glu Glu 
90 95 100 



1 



wo 01/68826 



PCT/EPOl/02753 



aat ggt ttt etc tgg gaa gaa ggt ccc aat agt ttt caa ccg tct gat 390 
Asn Gly Phe Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
105 110 115 120 

cct atg etc act atg gtg gta gat agt ggt ttg aag gat gat ttg gtg 438 
Pro Met Leu Thr Met Val Val Asp Ser Gly Leu Lys Asp Asp Leu Val 

125 130 135 

ttg gga gat cct act gcg cca agg ttt gtg ttg tgg aat ggg aaa ttg 486 
Leu Gly Asp Pro Thr Ala Pro Arg Phe Val Leu Trp Asn Gly Lys Leu 

140 145 150 

agg ccg gtt cca teg aag eta aca gac tta ccg ttc ttt gat ttg atg 534 
Arg Pro Val Pro Ser Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met 
155 160 165 

agt att ggt ggg aag att aga get ggt ttt ggt gea ctt ggc att cga 582 
Ser lie Gly Gly Lys lie Arg Ala Gly Phe Gly Ala Leu Gly He Arg 
170 175 180 

ccg tea cct cca ggt cgt gaa gaa tct gtg- gag gag ttt gta egg cgt 630 
vpro'^i'Ser Pro-'Pro Gly Arg Glu Glu -Ser Val Glu Glu Phe Val Arg Arg 

190 195 200 



;$4*aao*etei;^:ggtr^^gat'^igag-gtt ttt ^gag -^cgc ctg att gaa ccg ttt tgt tea 678 

c-"'Asn>Leu^Gly^Asp.;Glu -Val Phe.Glu rArg Leu He Glu Pro Phe Cys Ser 

205 210 215 

*-.ggtv-gtt ■ tat get ggt : gat cct tea aaa ctg age atg aaa gea gcg ttt 726 

■:^|[Glir^Val^Tyr^^Ma *G1^ Leu Sier Met Lys Ala Ala Phe 

"220 " 225 230 

•t^^ggg^^aiag gtt ^tgg aaa eta gag caa aat ggt gga age ata ata ggt ggt 774 

^v:GlyMLys vVal'^-Trp Lys Leu Glu Gin Asn Gly Gly Ser He He Gly Gly 

235 240 .245 

-•-actt^ttt^aag-gea att cag gag- aggvaaa aac get ccc aag gea gaa cga 822 

Thr^Phe Lys Ala He Gin Glu Arg. Lys Asn Ala Pro Lys Ala Glu Arg 

-250 255 260 

gac ccg cgc ctg cca aaa cca cag ggc caa aca gtt ggt tct ttc agg 870 

Asp Pro Arg Leu Pro Lys Pro Gin Gly Gin Thr Val Gly Ser Phe Arg 

265 270 275 280 

aag gga ctt cga atg ttg cca gaa gea ata tct gea aga tta ggt age 918 

Lys Gly Leu Arg Met Leu Pro Glu Ala He Ser Ala Arg Leu Gly Ser 

285 290 295 

aaa gtt aag ttg tct tgg aag etc tea ggt ate act aag ctg gag age 966 

Lys Val Lys Leu Ser Trp Lys Leu Ser Gly He Thr Lys Leu Glu Ser 

300 305 310 

gga gga tac aac tta aca tat gag act cca gat ggt tta gtt tec gtg 1014 

Gly Gly Tyr Asn Leu Thr Tyr Glu Thr Pro Asp Gly Leu Val Ser Val 

315 320 325 
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cag age aaa agt gtt gta atg acg gtg cca tct cat gtt gca agt ggt 1062 
Gin Ser Lys Ser Val Val Met Thr Val Pro Ser His Val Ala Ser Gly 
330 335 340 

etc ttg egc ect ctt tct gaa tct get gea aat gca etc tea aaa eta 1110 
Leu Leu Arg Pro Leu Ser Glu Ser Ala Ala Asn Ala Leu Ser Lys Leu 
345 350 355 360 

tat tac cca cca gtt gea gea gta tct ate teg tae ccg aaa gaa gca 1158 
Tyr Tyr Pro Pro Val Ala Ala Val Ser lie Ser Tyr Pro Lys Glu Ala 

365 370 375 

ate cga aca gaa tgt ttg ata gat ggt gaa eta aag ggt ttt ggg eaa 1206 
lie Arg Thr Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin 

380 385 390 

ttg eat cca egc acg eaa gga gtt gaa aca tta gga act ate tac age 1254 
Leu His Pro Arg Thr Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 
395 400 405 

tec tea etc ttt cca aat egc gca ccg cec gga aga att ttg ctg ttg 1302 
Ser Ser Leu Phe Pro Asn Arg Ala Pro Pro Gly Arg He Leu Leu Leu 
410 415 420 

aac tac att ggc ggg tct aca aac ace gga att ctg tec aag tct gaa 1350 
Asn Tyr He Gly Gly Ser Thr Asn Thr Gly He Leu Ser Lys Ser Glu 
425 430 435 440 

ggt gag tta gtg gaa gea gtt gae aga gat ttg agg aaa atg eta att 1398 
Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 

445 450 455 

aag cct aat teg ace gat cca ctt aaa tta gga gtt agg gta tgg cct 1446 
Lys Pro Asn Ser Thr Asp Pro Leu Lys Leu Gly Val Arg Val Trp Pro 

460 465 470 

eaa gee att cct cag ttt eta gtt ggt cac ttt gat ate ctt gae acg 1494 
Gin Ala He Pro Gin Phe Leu Val Gly His Phe Asp He Leu Asp Thr 
475 480 485 

get aaa tea tct eta acg tct teg ggc tac gaa ggg eta ttt ttg ggt 1542 
Ala Lys Ser Ser Leu Thr Ser Ser Gly Tyr Glu Gly Leu Phe Leu Gly 
490 495 500 

ggc aat tac gtc get ggt gta gee tta ggc egg tgt gta gaa ggc gca 1590 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 

510 515 520 

tat gaa acc gcg att gag gtc aac aac ttc atg tea egg tac get tac 1638 
Tyr Glu Thr Ala He Glu Val Asn Asn Phe Met Ser Arg Tyr Ala Tyr 

52S 530 535 

aag taa atgtaaaaca ttaaatctcc cagcttgcgt gagttttatt aaatattttg 1694 
Lys 

agatatccaa aaaaaaaaaa aaaaa 1719 
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<210> 2 
<211> 537 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 2 



Met 


Glu 


Leu 


Ser 


Leu 


Leu 


Arg 


Pro Thr Thr 


Gin Ser 


Leu 


Leu Pro 


Ser 


1 








5 






10 






15 




Phe 


Ser Lys 


Pro 


Asn 


Leu 


Arg 


Leu Asn Val 


Tyr Lys 


Pro 


Leu Arg 


Leu 








20 








25 






30 




Arg 


Cys 


Ser 


Val 


Ala Gly 


Gly 


Pro Thr Val 


Gly Ser 


Ser 


Lys He 


Glu 






35 










40 




45 






Gly 


Gly Gly 


Gly Thr Thr 


He 


Thr Thr Asp Cys Val 


He 


Val Gly Gly 




50 










55 




60 








Gly 


lie 


Ser 


Gly Leu Cys 


He 


Ala Gin Ala 


Leu Ala 


Thr 


Lys His 


Pro 


65 










70 






75 






80 


Asp 


Ala 


Ala 


Pro 


Asn 


Leu 


He 


Val Thr Glu 


Ala Lys 


Asp Arg Val 


Gly 










85 






90 






95 




Gly 


Asn 


lie 


He 


Thr Arg 


Glu 


Glu Asn Gly 


Phe Leu 


Trp 


Glu Glu 


Gly 








100 








105 






110 




Pro 


Asn 


Ser 


Phe 


Gin' 


Pro 


Ser 


Asp Pro Met 


Leu Thr 


Met 


Val Val Asp 






115 










120 




125 






Ser 


Gly Leu 


Lys 


Asp Asp 


Leu 


Val Leu Gly Asp Pro 


Thr 


Ala Pro 


Arg 




130 










135 




140 








<Ph(e4val- 


•'L-eu--^Trp:.-Asn Gly Lys-^Leu- Arg Pro 


•Val Pro 


Ser 


Lys Leu 


Thr 


.^145 










*K1:50 






155 






160 


Asp 


Leu- 


-Pro'^Phe' 


-Phe: 


Asp 


Leu 


Met Ser He 


Gly Gly 


Lys 


He Arg Ala 










^165 






170 






175 




:-Gly'-Phe Gly-'Alai-Deu^Gly- Id e* -Arg Pro Ser 


Pro Pro 


Gly Arg Glu Glu 






^H^80 








185 






190 




^ Ser***Val- 


Glu 


-Glu 


^Phe' Val 


Arg- 


-Arg Asn Leu 


Gly Asp 


Glu 


Val Phe 


Glu 






U95 










200 




205 






Ar g >Iieu 1 1 e .^Glu'-^Pr o^^Phe Gy s > - S e r G ly Val 


Tyr Ala 


Gly Asp Pro 


Ser 




-21-0 










215 




220 








M'LysvEeU' 


J Ser- i Met 


. Lys -Ala Ala^'Phe Gly Lys Val Trp 


Lys 


Leu Glu 


Gin 


•n?2!25 










230 






235 






240 


-v^iAsnV^Gly 


Gly ^Ser 


-Ide 


lie Gly Gly Thr Phe 


Lys Ala 


He : Gin Glu 


Arg 










245 






'250 






255 




LyS;^^Asn 


Ala 


Pro 


Lys 


Ala 


Glu 


Arg Asp Pro 


Arg Leu 


Pro 


Lys Pro 


Gin 








260 








265 






270 




Gly 


Gin 


Thr 


Val 


Gly 


Ser 


Phe 


Arg Lys Gly Leu Arg 


Met 


Leu Pro 


Glu 






275 










280 




285 






Ala 


lie 


Ser 


Ala 


Arg 


Leu Gly 


Ser Lys Val 


Lys Leu 


Ser 


Trp Lys 


Leu 




290 










295 




300 






Ser 


Gly 


lie 


Thr 


Lys 


Leu 


Glu 


Ser Gly Gly Tyr Asn 


Leu 


Thr Tyr Glu 


305 










310 






315 






320 


Thr 


Pro 


Asp Gly 


Leu 


Val 


Ser 


Val Gin Ser 


Lys Ser 


Val 


Val Met 


Thr 










325 






330 






335 




Val 


Pro 


Ser 


His 


Val 


Ala 


Ser Gly Leu Leu Arg Pro 


Leu 


Ser Glu 


Ser 








340 








345 






350 




• Ala 


Ala 


Asn 


Ala 


Leu 


Ser 


Lys 


Leu Tyr Tyr 


Pro Pro 


Val 


Ala Ala 


Val 






355 










360 




365 






Ser 


lie 


Ser 


Tyr 


Pro 


Lys 


Glu 


Ala He Arg 


Thr Glu 


Cys 


Leu He 


Asp 




370 










375 




3 80 






Gly 


Glu 


Leu 


Lys 


Gly 


Phe Gly Gin Leu His 


Pro Arg 


Thr 


Gin Gly Val 



385 390 395 400 
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Glu 


Thr 


Leu Gly Thr 


He 


Tyr 


Ser 


Ser 


Ser 


Leu 


Phe 


Pro 


Asn 


Arg 


Ala 








405 










410 










415 




Pro 


Pro 


Gly Arg 
420 


He 


Leu 


Leu 


Leu 


Asn 
425 


Tyr 


He 


Gly 


Gly 


Ser 
430 


Thr 


Asn 


Thr 


Gly 


He Leu 


Ser 


Lys 


Ser 


Glu 


Gly 


Glu 


Leu 


Val 


Glu 


Ala 


Val 


Asp 






435 








440 










445 






Arg 


Asp 
450 


Leu Arg 


Lys 


Met 


Leu 
455 


He 


Lys 


Pro 


Asn 


Ser 
460 


Thr 


Asp 


Pro 


Leu 


Lys 


Leu 


Gly Val 


Arg Val 


Trp 


Pro 


Gin 


Ala 


He 


Pro 


Gin 


Phe 


Leu 


Val 


465 








470 










475 










480 


Gly 


His 


Phe Asp 


He 
485 


Leu 


Asp 


Thr 


Ala 


Lys 
490 


Ser 


Ser 


Leu 


Thr 


Ser 
495 


Ser 


Gly 


Tyr 


Glu Gly Leu 


Phe 


Leu 


Gly 


Gly 


Asn 


Tyr 


Val 


Ala 


Gly 


Val 


Ala 






500 










505 










510 






Leu 


Gly 


Arg Cys 
515 


Val 


Glu 


Gly 


Ala 
520 


Tyr 


Glu 


Thr 


Ala 


He 
525 


Glu 


Val 


Asn 


Asn 


Phe 
530 


Met Ser 


Arg 


Tyr 


Ala 
535 


Tyr 


Lys 

















<210> 3 
<211> 1738 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (70) . . (1596) 

<223> Arabidopsis protox-2 

<400> 3 

ttttttactt atttccgtca ctgctttcga ctggtcagag attttgactc tgaattgttg 60 

cagatagca atg gcg tct gga gca gta gca gat cat caa att gaa gcg gtt 111 
Met Ala Ser Gly Ala Val Ala Asp His Gin He Glu Ala Val 
15 10 

tea gga aaa aga gtc gca gtc gta ggt gca ggt gta agt gga ctt gcg 159 
•Ser Gly Lys Arg Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala 
15 20 25 30 

gcg get tac aag ttg aaa teg agg ggt ttg aat gtg act gtg ttt gaa 207 
Ala Ala Tyr Lys Leu Lys Ser Arg Gly Leu Asn Val Thr Val Phe Glu 

35 40 45 

get gat gga aga gta ggt ggg aag ttg aga agt gtt atg caa aat ggt 255 
Ala Asp Gly Arg Val Gly Gly Lys Leu Arg Ser Val Met Gin Asn Gly 

50 55 60 

ttg att tgg gat gaa gga gea aac ace atg act gag get gag cca gaa 303 
Leu He Trp Asp Glu Gly Ala Asn Thr Met Thr Glu Ala Glu Pro Glu 
65 70 75 

gtt ggg agt tta ctt gat gat ett ggg ctt cgt gag aaa caa caa ttt 351 
Val Gly Ser Leu Leu Asp Asp Leu Gly Leu Arg Glu Lys Gin Gin Phe 
80 85 90 
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cca att tea cag aaa aag egg tat att gtg egg aat ggt gta cat gtg 399 

Pro lie Ser Gin Lys Lys Arg Tyr lie Val Arg Asn Gly Val Pro Val 
95 100 105 110 

atg eta cet acc aat cee ata gag ctg gte aea agt agt gtg etc tet 447 

Met Leu Pro Thr Asn Pro lie Glu Leu Val Thr Ser Ser Val Leu Ser 

115 120 125 

acc eaa tet aag ttt caa ate ttg ttg gaa eea ttt tta tgg aag aaa 495 

Thr Gin Ser Lys Phe Gin lie Leu Leu Glu Pro Phe Leu Trp Lys Lys 

130 135 140 

aag tec tea aaa gte tea gat gca tet get gaa gaa agt gta age gag 543 

Lys Ser Ser Lys Val Ser Asp Ala Ser Ala Glu Glu Ser Val Ser Glu 
145 150 155 

ttc ttt caa cgc cat ttt gga caa gag gtt gtt gac tat etc ate gac 591 

Phe Phe Gin Arg His Phe Gly Gin Glu Val Val Asp Tyr Leu lie Asp 

160 165 170 

cct '^ttt, gtt ggt gga aea agt get gcg gac ect gat tec ett tea atg 639 

Pro Phe Val Gly Gly Thr Ser Ala Ala Asp Pro Asp Ser Leu Ser Met 
i75 180 185 190 

"^aacft^^cat'-tctnttc^-eca^gat etc tgg :aa^^ aaa agt ttt ggc^tet 687 

''^^ys>M±sf?SBT^^^^ Glu V Lys Ser Phe Gly Ser^ 

U95 .200- . 205 

-attrat^ gtc.ijggt"t.gca^ ate- aga aea- aag ttt- get- get aaa ggt ggt aaa 735 
^^^^^BjM^l'^^^k^^^ Lys Gly Gly Lys 

' ''-210 -215 220 

H;agt^iaga^igacvaca-*mag. :agt tet eet .gge aea aaa, aag- ggt teg cgt-ggg 783 

"-^^Seru^Arg^^^sp^ThrAys /Ser Ser -ProvGly Thr Lys Lys* Gly Ser Arg Gly 
^2i25 230. ^ 235 

'^:':fec^ettditet.rttt. aag^t^g^ att ctt cet gat aeg ttg tge 831 

^Serit^lPhe.TSerrPhe Lys .Gly Gly Met Gin lie Leu Pro Asp Thr Leu Cys 

Cf^40 245 250 

aaa agt etc tea cat gat gag ate aat tta gac tee aag gta etc tet 879 

Lys Ser Leu Ser His Asp Glu lie Asn Leu Asp Ser Lys Val Leu Ser 
255 260 265 270 

ttg tet tac aat tet gga tea aga cag gag aac tgg tea tta tet tgt 927 

Leu Ser Tyr Asn Ser Gly Ser Arg Gin Glu Asn Trp Ser Leu Ser Cys 

275 280 285 

gtt teg eat aat gaa aeg cag aga caa aac ccc eat tat gat get gta 975 

Val Ser His Asn Glu Thr Gin Arg Gin Asn Pro His Tyr Asp Ala Val 

290 295 ' 300 

att atg aeg get cet etg tge aat gtg aag gag atg aag gtt atg aaa 1023 

He Met Thr Ala Pro Leu Cys Asn Val Lys Glu Met -Lys Val Met Lys 
305 310 315 
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gga gga caa ccc ttt cag eta aac ttt etc ccc gag att aat tac atg 1071 
Gly Gly Gin Pro Phe Gin Leu Asn Phe Leu Pro Glu lie Asn Tyr Met 
320 325 330 

ccc etc teg gtt tta ate aec aca tte aea aag gag aaa gta aag aga 1119 
Pro Leu Ser Val Leu lie Thr Thr Phe Thr Lys Glu Lys Val Lys Arg 
335 340 345 350 

cct ctt gaa ggc ttt ggg gta etc att cca tct aag gag caa aag cat 1167 
Pro Leu Glu Gly Phe Gly Val Leu lie Pro Ser Lys Glu Gin Lys His 

355 360 365 

ggt ttc aaa act eta ggt aca ett ttt tea tea atg atg ttt eea gat 1215 
Gly Phe Lys Thr Leu Gly Thr Leu Phe Ser Ser Met Met Phe Pro Asp 

370 375 380 

cgt tec cet agt gac gtt cat eta tat aea act ttt att ggt ggg agt 1263 
Arg Ser Pro Ser Asp Val His Leu Tyr Thr Thr Phe lie Gly Gly Ser 
385 390 395 

agg aae cag gaa eta gee aaa get tee act gac gaa tta aaa eaa gtt 1311 
Arg Asn Gin Glu Leu Ala Lys Ala Ser Thr Asp Glu Leu Lys Gin Val 
400 405 410 

gtg act tet gac ctt cag cga ctg ttg ggg gtt gaa ggt gaa ccc gtg 1359 
Val Thr Ser Asp Leu Gin Arg Leu Leu Gly Val Glu Gly Glu Pro Val 
415 . 420 425 430 

tct gtc aae eat tac tat tgg agg aaa gca ttc ecg ttg tat gac age 1407 
Ser Val Asn His Tyr Tyr Trp Arg Lys Ala Phe Pro Leu Tyr Asp Ser 

435 440 445 

age tat gac tea gte atg gaa gca att gac aag atg gag aat gat eta 1455 
Ser Tyr Asp Ser Val Met Glu Ala lie Asp Lys Met Glu Asn Asp Leu 

450 455 460 

cct ggg ttc ttc tat gca ggt aat cat ega ggg ggg etc tct gtt ggg 1503 
Pro Gly Phe Phe Tyr Ala Gly Asn His Arg Gly Gly Leu Ser Val Gly 
465 470 475 

aaa tea ata gca tea ggt tge aaa gca get gac ett gtg ate tea tac 1551 
Lys Ser lie Ala Ser Gly Cys Lys Ala Ala Asp Leu Val lie Ser Tyr 
480 485 490 

ctg gag tct tgc tea aat gac aag aaa eea aat gac age tta taa 1596 
Leu Glu Ser Cys Ser Asn Asp Lys Lys Pro Asn Asp Ser Leu 
495 500 505 

cattgtcaag gttegtcect ttttateaet tactttgtaa aettgtaaaa tgcaacaage 1656 

egccgtgega ttagceaaca acteageaaa acceagattc tcataagget caetaattcc 1716 

agaataaact atttatgtaa aa 1738 



<210> 4 
<211> 508 
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<212> PRT 

<213> Arabidopsis tha liana 



<400> 4 



Met 


Ala 


Ser 


Gly Ala Val Ala Asp His 


Gin 


He 


Glu 


Ala 


Val 


Ser 


Gly 


1 






5 






10 










15 




Lys 


Arg 


Val 


Ala Val Val Gly Ala Gly 


Val 


Ser Gly 


Leu 


Ala 


Ala 


Ala 








20 




25 










30 






Tyr 


Lys 


Leu 


Lys Ser Arg Gly Leu Asn 


Val 


Thr 


Val 


Phe 


Glu Ala Asp 






35 




40 










45 








Gly 


Arg 


Val 


Gly Gly Lys Leu Arg 


Ser 


Val 


Met 


Gin 


Asn Gly Leu 


He 




50 




55 










60 










Trp 


Asp 


Glu 


Gly Ala Asn Thr Met 


Thr 


Glu 


Ala 


Glu 


Pro 


Glu 


Val 


Gly 


65 






70 








75 










80 


Ser 


Leu 


Leu 


Asp Asp Leu Gly Leu Arg 


Glu 


Lys 


Gin 


Gin 


Phe 


Pro 


He 








85 






90 










95 




Sex 


Gin 


Lys 


Lys Arg Tyr He 


Val 


Arg 


Asn Gly Val 


Pro 


Val 


Met 


Leu 








100 




105 










110 






Pro 


Thr 


Asn 
115 


Pro He Glu Leu 


Val 
120 


Thr 


Ser 


Ser 


Val 


Leu 
125 


Ser 


Thr 


Gin 


'^Ser 


Lys 
130 


Phe 


Gin He Leu Leu 

135 


Glu 


Pro 


Phe 


Leu 


Trp 
140 


Lys 


Lys 


Lys 


Ser 


"-Ser 


Lys- 


Val 


Ser Asp Ala Ser 


Ala 


Glu 


Glu 


Ser 


Val 


Ser 


Glu 


Phe 


Phe 


145 






l^SO 








155 










160 


^Gln-^Arg- 


His 


Phe Gly Gln;^Glu *Val Val 


Asp 


Tyr 


Leu 


He rAsp Pro Phe 








^- ^ ^165 






170 










175 






-fGly4 


^GtLy^ 


■iThr>Serv 'Ala^;Ala:i?eAsp Pro 


Asp 


Ser 


Leu 


Ser 


Met 


Lys 


His 








180 




185 










190 






-viSer<vphe"-^Pro^>Asp Leu' TrpvAsn^xVal 


Glu 


Lys 


Ser 


Phe 


Gly Ser He 


He 






>t95 


"*-200 










205 








-•^yali^^ly.^ 


^A-l-a" 


^aei>Argi^Thr- -Lys^^OPhe/ Ala 


Ala 


• Lys 


Gly 


Gly^ Lys 'Ser Arg 




C210 




215 










220 










'^^AspfeThr, 'Lys 


■Ser Ser Pro .Gly * Thr Lys 


Lys 


Gly 


Ser 


Arg Gly Ser 


Phe 


.^f225 






- -230 • 








235 










'240 


*i^Ser- 


'^'Phe Lys -Gly Gly Met Gin He Leu 


Pro Asp 


Thr 


Leu 


Cys 


Lys 


Ser 








■245 






250 










•^255 




^iJtfLeu »Seri5His ^Asp-'Glu lie.. Asn Leu Asp 


Ser 


Lys 


Val 


Leu 


Ser 


Leu 


Ser 








260 




265 










270 






♦-Tyr- Asn 


Ser Gly Ser Arg .Gin 


Glu 


Asn 


Trp 


Ser 


Leu 


Ser 


Cys. 


Val 


Ser 






275 




280 










285 








His 


Asn 
290 


Glu 


Thr Gin Arg Gin 

295 


Asn 


Pro 


His 


Tyr 


Asp 
300 


Ala 


Val 


He 


Met 


Thr 


Ala 


Pro 


Leu Cys Asn Val 


Lys 


Glu 


Met 


Lys 


Val 


Met 


Lys 


Gly Gly 


305 






310 








315 










320 


Gin 


Pro 


Phe 


Gin Leu Asn Phe 


Leu 


Pro 


Glu 


He 


Asn 


Tyr Met 


Pro 


Leu 








325 






330 










335 




Ser 


Val 


Leu 


He Thr Thr Phe 
340 


Thr 


Lys 
345 


Glu 


Lys 


Val 


Lys 


Arg 
350 


Pro 


Leu 


Glu Gly 


Phe Gly Val Leu He 


Pro 


Ser 


Lys 


Glu 


Gin 


Lys 


His 


Gly 


Phe 






355 




360 










365 








Lys 


Thr 


Leu Gly Thr Leu Phe 


Ser 


Ser 


Met 


Met 


Phe 


Pro 


Asp 


Arg 


Ser 




370 




375 










380 










Pro 


Ser 


Asp Val ms Leu Tyr 


Thr 


Thr 


Phe 


He Gly 


Gly 


Ser 


Arg Asn 


385 






390 








395 










400 


Gin 


Glu 


Leu 


Ala Lys Ala Ser Thr Asp 


Glu 


Leu 


Lys 


Gin 


Val 


Val 


Thr 








405 






410 










415 




Ser 


Asp 


Leu Gin Arg Leu Leu Gly Val 


Glu Gly Glu 


Pro 


Val 


Ser 


Val 
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420 425 430 



Asn 


His 


Tyr 
435 


Tyr 


Trp 


Arg 


Lys 


Ala 
440 


Phe 


Pro 


Leu 


Tyr Asp 
445 


Ser 


Ser 


Tyr 


Asp 


Ser 


Val 


Met 


Glu 


Ala 


lie 


Asp 


Lys 


Met 


Glu 


Asn Asp 


Leu 


Pro Gly 


450 










455 










460 








Phe 


Phe 


Tyr 


Ala 


Gly Asn His Arg Gly Gly Leu 


Ser Val Gly Lys 


Ser 


465 








470 










475 








480 


lie 


Ala 


Ser 


Gly 


Cys 
485 


Lys 


Ala 


Ala 


Asp 


Leu 
490 


Val 


lie Ser 


Tyr 


Leu 
495 


Glu 


Ser 


Cys 


Ser 


Asn 


Asp 


Lys 


Lys 


Pro 


Asn 


Asp 


Ser 


Leu 









500 505 



<210> 5 
<211> 1691 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . (1443) 

<223> maize protox-1 c-DNA{not full-length) 
<400> 5 

gcg gac tgc gtc gtg gtg ggc gga ggc ate agt ggc etc tgc acc gcg 48 
Ala Asp Cys Val Val Val Gly Gly Gly lie Ser Gly Leu Cys Thr Ala 
15 10 15 

cag gcg ctg gcc acg egg cac ggc gtc ggg gac gtg ctt gtc acg gag 96 
Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val Thr Glu 

20 25 30 

gcc cgc gcc cgc ccc ggc ggc aac att acc acc gtc gag cgc ccc gag 144 
Ala Arg Ala Arg Pro Gly Gly Asn lie Thr Thr Val Glu Arg Pro Glu 
35 40 45 

gaa ggg tac etc tgg gag gag ggt ccc aac age ttc cag ccc tec gac 192 
Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
' 50 55 60 

ccc gtt etc acc atg gee gtg gac age gga ctg aag gat gac ttg gtt 240 
Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
65 70 75 80 

ttt ggg gac oca aac gcg ccg cgt ttc gtg ctg tgg gag ggg aag ctg 288 
Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly Lys Leu 

85 90 95 

agg ccc gtg cca tec aag ccc gcc gac etc ccg ttc ttc gat etc atg 336 
Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp Leu Met 

100 105 110 

age ate cca ggg aag etc agg gcc ggt eta ggc gcg ctt ggc ate cgc 384 
Ser He Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly He Arg 
115 120 125 
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ccg cct cct cca ggc cgc gaa gag tea gtg gag gag ttc gtg cgc cgc 432 
Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
130 135 140 

aac etc ggt get gag gtc ttt gag egc etc att gag cct ttc tge tea 480 
Asn Leu Gly Ala Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser 
145 150 155 160 

ggt gtc tat get ggt gat cet tct aag etc age atg aag get gea ttt 528 
Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser* Met Lys Ala Ala Phe 

165 170 175 

ggg aag gtt tgg egg ttg gaa gaa act gga ggt agt att att ggt gga 576 
Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser lie lie Gly Gly 

180 185 190 

ace ate aag aca att cag gag agg age aag aat cca aaa cca ccg agg 624 
Thr lie Lys Thr lie Gin Glu Arg Ser Lys Asn Pro Lys Pro Pro Arg 
195 200 205 

-vgat'-gee egc ctt ccg aag cea aaa ggg eag aca gtt gea tct ttc agg 672 
-Asp. Ala -Arg Leu Pro- Lys- . Pro Lys Gly Gin Thr Val Ala Ser Phe Arg 
'210 215 220 

aag^i^ggt r'ctt gee atg-ictt cca aat gee att ' aca tec age ttg ggt agt 720 
**rLys^Gly.'*Leuv A{Lar^Met.ft'Leu>Pro^Asnr Aia lie- Thr Ser Ser Leu Gly Ser 
^'2'25 .:230 235 240 

-aaat^gtc'';*aaa = eta vtea tgg:^;aaa cte-^acg age att aca aaa tea gat gac 768 
, Ly s'Wal ■ Lys.. Leu ^S e r Trp f^Ly s .-Leu' Thr Se r 1 1 e Thr Lys Ser' Asp Asp 

'^245 250 . ,255 

jK'^aag^gga ,tat« gtt-ttgv?gag^ttat^;gaa aeg cca gaa ggg gtt gtt teg gtg ' 816 
-?^Eys -Gly- Tyr* Val- Leu^Glu- Tyr. GluvThr Pro ^Glu Gly Val Val Ser Val 

260 -265 270 . 

A^ag.gct .aaa agt . -.gtt ato, atg act .att cca tea tat gtt get age aac' 864 
;^^Gln"Ala Lys Ser Val Ile^ Met Thr- lie Pro Ser Tyr Val Ala Ser Asn 
275 ..2,80 285 

att ttg egt cea ctt tea age gat get gea gat get eta tea aga ttc 912 
lie Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser Arg Phe 
290 295 300 

tat tat cca ccg gtt get get gta act gtt teg tat cca aag gaa gea 960 
Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala 
305 310 315 320 

att aga aaa gaa tgc tta att gat ggg gaa etc eag gge ttt ggc cag 1008 
lie Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin 

325 330 335 

ttg eat cca egt agt caa gga gtt gag aca tta gga aca ata tae agt 1056 
Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 

340 345 350 

tec tea etc ttt cca aat egt get cct gac ggt agg gtg tta ctt eta 1104 
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Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val Leu Leu Leu 
355 360 365 

aac tac ata gga ggt get aca aac aca gga att gtt tec aag act gaa 
Asn Tvr He Gly Gly Ala Thr Asn Thr Gly He Val Ser Lys Thr Glu 



375 380 



agt gag ctg gtc gaa gca gtt gac cgt gac etc cga aaa atg ctt ata 
Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu lie 

395 400 



385 390 



1152 



1200 



1248 



1296 



aat tct aca gca gtg gac cct tta gtc ctt ggt gtt cga gtt tgg cca 
Asn ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg Val Trp Pro 

405 410 415 

caa gcc ata cct cag ttc ctg gta gga cat ctt gat ctt ctg gaa gcc 
Gin Ala lie Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Glu Ala 

420 425 430 

gca aaa get gcc ctg gac cga ggt ggc tac gat ggg ctg ttc eta gga 1344 
Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
435 440 445 



ggg aac tat gtt gca gga gtt gcc ctg ggc aga tgc gtt gag ggc gcg 1392 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 
450 455 460 

tat gaa agt gcc teg caa ata tct gac ttc ttg ace aag tat gcc tac 1440 
Tvr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 
465 470 475 480 

aag tgatgaaaga agtggagcgc tacttgttaa tcgtttatgt tgcatagatg 1493 
Lys 

aggtgcctcc ggggaaaaaa aagcttgaat agtatttttt attcttattt tgtaaattgc 1553 
atttctgttc ttttttctat cagtaattag ttatatttta gttctgtagg agattgttct 1613 
gttcactgcc cttcaaaaga aattttattt ttcattcttt tatgagagct gtgctactta 1673 
aaaaaaaaaa aaaaaaaa 1691 



<210> 6 
<211> 481 
<212> PRT 
<213> Zea mays 

<400> 6 ^ -1 

Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu Cys Thr Ala 
15 10 15 

Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val Thr Glu 

20 25 30 

Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg Pro Glu 
35 40 45 
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Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
50 55 60 

Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
65 70 75 80 

Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly Lys Leu 

85 90 95 

Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp Leu Met 

100 105 110 

Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly lie Arg 
115 120 125 

Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
130 135 140 

Asn Leu Gly Ala Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser 
•145 150 155 160 

Giy^Val Tyr Ala- Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 

165 170 175 

^Gly^-Lys^ValH^.Trp'^fArg>^Leu-Glu^•Gl^ He lie Gly Gly 

VV1!80 185 190 

«Thr Ile^Lysv^Thr* Ile-Gln'i^Glu^ Arg Ser ■ Lys Asn Pro Lys' Pro Pro Arg 
*"195 -200 • 205 

-Asp. Ala-Arg-rLeu' Pro.Lys:-ProMLys Gly Gin Thr Val Ala Ser Phe Arg 

^^210 :-2:iS 220 

^Lys-^Gly-Leu- Ala**-Met ^Leu Pro-Asn Alai lie Thr Ser Ser Leu Gly Ser 
'-■225 .230 235 240 

-"LysVyal. Lys -Leu Ser Trp Lys Leu Thr Ser He Thr Lys* Ser Asp Asp 

"245 250 255 

Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val Val Ser Val 

260 265 270 

Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val Ala Ser Asn 
275 280 285 

He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser Arg Phe 
290 295 300 

Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala 
305 310 315 320 

He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin 

325 330 335 

Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 

340 345 350 
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ser ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val Leu Leu Leu 
355 360 365 

Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser Lys Thr Glu 
370 375 380 

Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 
385 390 395 400 

Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg Val Trp Pro 

405 410 415 

Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Glu Ala 

420 425 430 

Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
435 440 445 

Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 
450 455 460 

Tvr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 
465 ' 470 475 " 480 



Lys 



<210> 7 
<211> 2061 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (64) . . (1698) 

<223> maize protox-2 



<400> 7 4. 4. cn 

ctctcctacc tccacctcca cgacaacaag caaatcccca tccagttcca aaccctaact bu 



caa atg etc get ttg act gcc tea gee tea tec get teg tec cat cct 
Met Leu Ala Leu Thr Ala Ser Ala Ser Ser Ala Ser Ser His Pro 
1 5 10 15 

tat cge cac gcc tee gcg cac act cgt cgc ccc cgc eta cgt gcg gtc 
Tvr Arg His Ala Ser Ala His Thr Arg Arg Pro Arg Leu Arg Ala Val 

20 25 30 

etc gcg atg gcg ggc tec gac gac ccc cgt gca gcg ccc gcc aga teg 
Leu Ala Met Ala Gly Ser Asp Asp Pro Arg Ala Ala Pro Ala Arg Ser 

35 40 45 



108 



156 



204 



gtc gcc gtc gtc ggc gee ggg gtc age ggg etc gcg gcg gcg tac agg 252 
Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala Ala Ala Tyr Arg 
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50 55 60 

etc aga cag age ggc gtg aac gta aeg gtg ttc gaa gcg gee gac agg 300 

Leu Arg Gin Ser Gly Val Asn Val Thr Val Phe Glu Ala Ala Asp Arg 
65 70 75 

gcg gga gga aag ata egg acc aat tec gag gge ggg ttt gtc tgg gat 348 

Ala Gly Gly Lys lie Arg Thr Asn Ser Glu Gly Gly Phe Val Trp Asp 
80 85 90 95 

gaa gga get aae acc atg aca gaa ggt gaa tgg gag gee agt aga ctg 396 

Glu Gly Ala Asn Thr Met Thr Glu Gly Glu Trp Glu Ala Ser Arg Leu 

100 105 110 

att gat gat ctt ggt eta caa gac aaa cag cag tat ect aac tec caa 444 

lie Asp Asp Leu Gly Leu Gin Asp Lys Gin Gin Tyr Pro Asn Ser Gin 

115 120 125 

cac aag cgt tac att gtc aaa gat gga gca cca gca ctg att ect teg 492 

His Lys Arg Tyr lie Val Lys Asp Gly Ala Pro Ala Leu lie Pro Ser 
130 135' 140 

i,'gat^cce -att teg eta atg aaa age -agt gtt ctt teg aca aaa tea aag 540 

Asp'^Pro 'lie -Ser Leu. Met Lys Ser Ser Val Leu Ser Thr Lys Ser Lys 
'145 150 155 

.?>y^att^^geg^t^ta.<Kt^tt^^ tac aag- aaa, get aac aca aga 588 

• He 'Ala LeurVrPhe.-^Phe* Glu Pro Phe Leu Tyr Lys Lys Ala Asn Thr Arg 
■s^l-60 ^^165 . 170 175 

i^aae^tTct^gga^ftaaaiipgtg ;tct gag f^^^ ttg agt gag agt gtt ggg age 636 

.v Asn•*■^^Ser•^Gly-^Lys rVal'.Ser^ Glu- GlU 'His Leu Ser Glu Ser Val Gly Ser 

•180 185 190 

' ttciitgt- gaa ege cae ttt gga' aga * gaa gtt gtt gae tat ttt gtt gat 684 

;^Phe^>eys'^Giu' Arg^fHis^'-'P^ Arg- Glu Val Val Asp Tyr Phe Val Asp 

'195 -200 205 

cca ;ttt gta. get ;gga aca agt gca gga gat eca gjag tea eta tet att 732 

Pro 'Phe Val Ala Gly Thr Ser Ala Gly Asp Pro Glu Ser Leu Ser He 
210 215 220 

egt cat gca ttc eea gca ttg tgg aat ttg gaa aga aag tat ggt tea 780 

Arg His Ala Phe Pro Ala Leu Trp Asn Leu Glu Arg Lys Tyr Gly Ser 
225 230 235 

gtt att gtt ggt gee ate ttg tet aag eta gca get aaa ggt gat eea 828 

Val He Val Gly Ala He Leu Ser Lys Leu Ala Ala Lys Gly Asp Pro 
240 245 250 255 

gta aag aea aga eat gat tea tea ggg aaa aga agg aat aga cga gtg 876 

Val Lys Thr Arg His Asp Ser Ser Gly Lys Arg Arg Asn Arg Arg Val 

260 265 270 

teg ttt tea ttt eat ggt gga atg cag tea eta ata aat gea ctt eac 924 

Ser Phe Ser Phe His Gly Gly Met Gin Ser Leu He Asn Ala Leu His 

275 280 285 
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^ I- s s £ s s s s s ?s '£ fi r.; 

290 295 300 

ttg gca tgt aca ttt gat gga gtt cct gca eta ggc agg tgg tea att 1020 
^eu Ala cys Thr Phe Asp Gly Val Pro Ala Leu Gly Arg Trp Ser He 
305 310 315 



1068 



tct gtt gat teg aag gat age ggt gac aag gac ctt get agt aac caa 
ser val Lp Ser Lys Asp Ser Gly Asp Lys Asp Leu Ala Ser Asn Gin 
320 325 330 

.r-r- hfct aat act gtt ata atg aca get oca ttg tea aat gtc egg agg 1116 
Phe ASP S Va^ He Me? Ala Pro Leu Ser Asn Val Arg Arg 

340 345 ^=5" 

atg aag ttc acc aaa ggt gga get eeg gtt gtt ctt gae ttt ctt cct 1164 
Me? Lys Phe Ota Lys Gly Gly Ala Pro Val Val Leu Asp Phe Leu Pro 

355 360 365 

aag atg gat tat eta cca eta tct etc atg gtg act get ttt aag aag 1212 
lyl Me? ASP Tyr Leu Pro Leu Ser Leu Met Val Thr Ala Phe Lys Lys 
370 375 380 

aat aat gtc aag aaa cct ctg gaa gga ttt ggg gtc tta ata ect tac 1260 
ASP Sp .val Lys Lys Pro Leu Glu Gly Phe Gly Val Leu He Pro Tyr 
385 390 395 

aag gaa cag caa aaa eat ggt ctg aaa acc ctt ggg act etc ttt tec 1308 
Lys Glu Gin Gin Lys His Gly Leu Lys Ihr Leu Gly Thr Leu Phe Ser 
400 405 410 415 

tea atg atg ttc cca gat cga get cct gat gac caa tat tta tat aca 1356 
ser Me? Me? Phe Pro Asp Arg Ala Pro Asp Asp Gin Tyr Leu Tyr Thr 

420 425 430 

aca ttt gtt ggg ggt age cac aat aga gat ctt get gga get cca aeg 1404 
Thr Phe val Gly Gly Ser His Asn Arg Asp Leu Ala Gly Ala Pro Thr 

435 440 445 

tct att ctg aaa caa ctt gtg acc tct gac ctt aaa aaa etc ttg ggc 1452 
Ser He Leu Lys Gin Leu Val Thr Ser Asp Leu Lys Lys Leu Leu Gly 
450 455 460 

gta gag ggg caa cca act ttt gtc aag cat gta tac tgg gga aat get 1500 
val Glu Gly Gin Pro Thr Phe Val Lys His Val Tyr Trp Gly Asn Ala 
465 470 475 

ttt cct ttg tat ggc cat gat tat agt tct gta ttg gaa get ata gaa 1548 
Phe Pro Leu Tyr Gly His Asp Tyx Ser Ser Val Leu Glu Ala He Glu 
480 485 490 495 

aag atg gag aaa aac ctt cca ggg ttc ttc tac gca gga aat age aag 1596 
lys Me? GI? Lys Asn Leu Pro Gly Phe Phe Tyr Ala Gly Asn Ser Lys 

500 505 510 



15 



wo 01/68826 



PCT/EPOl/02753 



gat ggg ctt get gtt gga agt gtt ata get tea gga- age aag get get 1644 
Asp Gly Leu Ala Val Gly Ser Val lie Ala Ser Gly Ser Lys Ala Ala 

515 520 525 

gae ctt gca ate tea tat ett gaa tct cac ace aag cat aat aat tea 1692 
Asp Leu Ala lie Ser Tyr Leu Glu Ser His Thr Lys His Asn Asn Ser 
530 535 540 

eat tga aagtgtctga eetatcctet agcagttgtc gaeaaatttc tccagtteat 1748 
His 

545 

gtacagtaga aacegatgeg ttgcagttte agaaeatctt cacttettca gatattaacc 1808 
cttegttgaa cateeaecag aaaggtagtc acatgtgtaa gtgggaaaat gaggttaaaa 1868 
actattatgg cggeegaaat gttcettttt gttttcctca eaagtggcct acgacacttg 1928 
atgttggaaa tacatttaaa tttgttgaat tgtttgagaa cacatgegtg aegtgtaata 1988 
tttgcctatt gtgattttag cagtagtctt ggceagatta tgctttacgc ctttaaaaaa 2048 
aaaaaaaaaa aaa 2061 



-E^21''0>v 8 

•45:21'2^-^PRT 

^■,<2 13>j^sZeatt?iaays 

. ^t5c:400>^^8 

-fTt■Met>LeutAlavLeu^iThr^Ala^^ S Ser- Ala Ser Ser His -^Pro Tyr 

-.1 5 10 15 

^Arg*misyma^Ser^Ala^:Hds■>Thr^Arg Ar Arg Ala Val Leu 

-20 .-25 30 



vAaa*^Met'*^Ala^*Gly-^*Ser'>*AspiAsp" Pro Arg. Ala Ala Pro* Ala Arg Ser Val 







r35 




.;40 






45 








Ala^Val^^Val^ 


^Gly 


Ala Gly Val 'Ser -Gly- Leu Ala Ala 


Ala 


Tyr 


Arg- 


Leu 




■^50 






55 




60 










Arg^tGin. Ser 


Gly 


Val. Asn Val Thr 


Val Phe . Glu Ala 


Ala 


Asp 


Arg 


Ala 


65 








70 




75 




80 


Gly 


Gly 


Lys 


lie 


Arg Thr Asn Ser 


Glu Gly Gly Phe 


Val 


Trp 


Asp 


Glu 










85 


90 








95 




Gly 


Ala 


Asn 


Thr 


Met Thr Glu Gly 


Glu Trp 


Glu Ala 


Ser 


Arg 


Leu 


He 








100 




105 






110 






Asp 


Asp 


Leu 


Gly 


Leu Gin Asp Lys 


Gin Gin 


Tyr Pro 


Asn 


Ser 


Gin 


His 






115 




120 






125 








Lys 


Arg 


Tyr 


He 


Val Lys Asp Gly 


Ala Pro 


Ala Leu 


He 


Pro 


Ser 


Asp 




130 






135 




140 








Pro 


lie 


Ser 


Leu 


Met Lys Ser Ser 


Val Leu 


Ser Thr 


Lys 


Ser 


Lys 


He 


145 








150 


• 


155 






160 


Ala 


Leu 


Phe 


Phe 


Glu Pro Phe Leu 


Tyr Lys 


Lys Ala 


Asn 


Thr 


Arg 


Asn 










165 


170 








175 




Ser 


Gly 


Lys 


Val 


Ser Glu Glu His 


Leu Ser 


Glu Ser 


Val 


Gly 


Ser 


Phe 








180 




185 






190 






Cys 


Glu 


Arg 


His 


Phe Gly Arg Glu 


Val Val Asp Tyr 


Phe 


Val 


Asp 


Pro 






195 




200 






205 
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Phe Val Ala GlY Thr Ser Ala Gly Asp Pro Glu Ser Leu Ser He Arg 

220 

Lys 

Lys 

245 250 255 



His t3l Phe pro Ala Leu Trp Asn Leu Glu Arg Lys lYr Gly Ser Val 
725 230 235 

lie val Gly Ala He Leu Ser Lys Leu Ala Ala Lys Gly Asp Pro Val 



Lys Thr Arg His Asp Ser Ser Gly Lys Arg Arg Asn Arg Arg Val Ser 

Phe ser Phe His Gly Gly Met Gin Ser Leu He Asn Ala Leu His Asn 

275 280 285 

Glu val Gly ASP ASP Asn Val Lys Leu Gly Thr Glu Val Leu Ser Leu 

990 295 300 

Ala cys Thr Phe Asp Gly Val Pro Ala Leu Gly Arg Trp Ser He Ser 
305 310 315 

val ASP ser Lys Asp Ser Gly Asp Lys Asp Leu Ala Ser Asn Gin Thr 

Phe ASP Ala val lie Met Thr Ala Pro Leu Ser Asn Val Arg Arg Met 

340 345 
Lys Phe Thr Lys Gly Gly Ala Pro Val Val Leu Asp Phe Leu Pro Lys 

* 355 360 365 

Met Asp Tyr Leu Pro Leu Ser Leu Met Val Thr Ala Phe Lys Lys Asp 

370 375 380 

ASP val Lys Lys Pro Leu Glu Gly Phe Gly Val Leu He Pro Tyr Lys 
.ooir 390 395 

Glu Gin Gin Lys His Gly Leu Lys Thr Leu Gly Thr Leu Phe Ser Ser 

4Q5 410 *ki.D 

Met Met Phe Pro Asp Arg Ala Pro Asp Asp Gin Tyr Leu Tyr Thr Thr 

420 425 430 

Phe val Gly Gly Ser His Asn Arg Asp Leu Ala Gly Ala Pro Thr Ser 

435 440 445 

He Leu Lys Gin Leu Val Thr Ser Asp Leu Lys Lys Leu Leu Gly Val 

450 455 460 

Glu Gly Gin Pro Thr Phe Val Lys His Val Tyr Trp Gly Asn Ala Phe 
465 470 475 

Pro Leu Tyr Gly His Asp Tyr Ser Ser Val Leu Glu Ala He Glu Lys 

485 490 495 

Met Glu Lys Asn Leu Pro Gly Phe Phe Tyr Ala Gly Asn Ser Lys Asp 

500 505 510 

Gly Leu Ala Val Gly Ser Val He Ala Ser Gly Ser Lys Ala Ala Asp 

53^5 520 525 

Leu Ala He Ser Tyr Leu Glu Ser His Thr Lys His Asn Asn Ser His 
530 535 540 



<210> 9 
.<211> 1811 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (3).. (1589) 

<223> wheat protox-1 



gj°gca\ca atg gcc acc gcc acc gtc gcg gcc gcg teg ccg etc egc 47 
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Ala Thr Met Ala Thr Ala Thr Val Ala Ala Ala Ser Pro Leu Arg 
15 10 15 

ggc agg gtc acc ggg cgc cca cac cgc gtc cgc ccg cgt tgc get acc 95 
Gly Arg Val Thr Gly Arg Pro His Arg Val Arg Pro Arg Cys Ala Thr 

20 25 30 

gcg age age gcg aec gag act eeg geg geg cee gge gtg egg ctg tee 143 
Ala Ser Ser Ala Thr Glu Thr Pro Ala Ala Pro Gly Val Arg Leu Ser 

35 40 45 

gcg gaa tge gtc att gtg ggc gcc ggc ate age gge etc tgc ace geg 191 
Ala Glu Cys Val He Val Gly Ala Gly He Ser Gly Leu Cys Thr Ala 
50 55 60 

eag gcg etg gcc acc cga tac gge gtc age gae ctg etc gtc acg gag 239 
Gin Ala Leu Ala Thr Arg Tyr Gly Val Ser Asp Leu Leu Val Thr Glu 
65 70 75 

gcc cge gae cgc ccg ggc ggc aac ate ace ace gtc gag cgt cee gae 287 
Ala Arg Asp Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg Pro Asp 
80 85 90 95 

gag- ggg -tac ctg tgg gag gag.gga cee aac age tte cag ccc tec gae 335 
:-Glui(iGiy Tyr *Leu Tirp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 

'1^00 105 110 

'■^Qcgigtcvctc- acc^atg^^gcC' gtg gac age ggg etc aag gat gae ttg gtg 383 
*^r-Bro?*Val^Leu^iThr^>Met -Ala-A^al-^-^ Ser -Gly . Leu Lys Asp Asp Leu Val 

■51^15 120 125 

?rttcrsgggi«f'gac*.cec^^aac;!:geg..cec egg *ttc. gtg ctg^tgg gag -ggg aag etg 431 
^Phe-'^GlyvAsp '^Pro-* ,AsnwAla*irPro-.Arg Phe Val Leu Trp Glu Gly Lys Leu 
A mo 135 140 

<?agg:^'cGg^*ig;tgv:ecg- tcgsfaag cca^ggc .gae -'ctg cct ttc ttc -age etc atg 479 
^--Arg^rO'^Val : Pro-iSer-vLys yPro .Gly Asp Leu Pro Phe Phe Ser Leu Met 
'^W4S ISO 155 

agt..atc.cct ggg aag etc agg gcc ggc ctt gge geg etc ggc att cgc 527 
Ser He Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly He Arg 
160 165 170 175 

cca cet cct cca ggg cgc gag gag teg gtg gag gag ttt gtg cgc ege 575 
Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 

180 185 190 

aac etc ggt gee gag gtc ttt gag cgc etc ate gag cct ttc tgc tea 623 
Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser 

195 200 205 

ggt gta tat get ggt gat cct teg aag ett agt atg aag get gca ttt 671 
Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 
210 215 220 

ggg aag gtc tgg agg ttg gag gag att gga ggt agt att att ggt gga 719 
Gly Lys Val Trp Arg Leu Glu Glu He Gly Gly Ser He He Gly Gly 
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225 230 235 



acc ate aag gcg att cag gat aaa ggg aag aac ccc aaa ccg cca agg 
tS III LYS Ala lie Gin Asp Lys Gly Lys Asn Pro X,ys Pro Pro Arg 
240 245 250 255 



767 



nat- CCC caa ctt ccg gca cca aag gg<a i;aa a -a » — 

£p Pro Arg Leu Pro Ala Pro Lys Gly Gin Thr Val Ala Ser Phe Arg 

aaa aat eta gee atg cte ceg aat gui; ate gca tct agg ctg ggt 
Lys It 2u Ma Met Leu Pro Asn Ala He Ala Ser Arg Leu Gly Ser 

280 285 



863 



1007 



aat ccc cga ctt ccg gca cca aag gga cag acg gtg gca tct ttc agg 815 

Pro 
260 

aaa aat eta gee atg cte ceg aat gee ate gca tct agg etg ggt agt 
aag ggt a^*- '^a _ _ ,,_-rn_-«T^ c^v- avr. t.oh rIv Ser 

Ala 

275 

aaa ate aag ctg tea tgg aag ctt aeg age att aca aag gcg gac aac 911 

5s vTl Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ala Asp Asn 
290 295 300 

caa gga tat gta tta ggt tat gaa aca cca gaa gga ctt gtt tea gtg 959 
Sn lly Tyr Val Leu Gly Tyr Glu Thr Pro Glu Gly Leu Val Ser Val 
305 310 315 

eaa act aaa agt gtt ate atg acc ate ccg tea tat gtt get agt gat 
gS III lys slv ?al lie Met Thr He Pro Ser Tyr Val Ala Ser Asp 
320 325 330 

ate ttg egc cca ctt tea att gat gea gca gat gca cte tea aaa ttc 
lie Leu Arg Pro Leu Ser He Asp Ala Ala Asp Ala Leu Ser Lys Phe 

340 345 350 

tat tat ccg cca gtt get get gta act gtt tea tat cca aaa gaa get 
JJr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala 

355 360 365 

att aga aaa gaa tge tta att gat ggg gag etc cag ggt ttc ggc cag 1151 
He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin 
370 375 380 

ttg eat cca cgt age caa gga gtc gag act tta ggg aca ata tat age 1199 
Su Tis Pro T^g ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 
385 390 395 

tct tct etc ttt cct aat cgt get ect get gga aga gtg tta ctt ctg 
ser ser Leu Phe Pro Asn Arg Ala Pro Ala Gly Arg Val Leu Leu Leu 
400 405 410 415 



1055 



1103 



1247 



aac tat ate ggg ggt tct aca aat aca ggg ate gtc tec aag act gag 1295 
Asn Sr ?ie III Gly Ser Thr Asn Thr Gly He Val Ser Lys Thr Glu 

420 425 430 

agt gac tta gta gga gcc gtt gac cgt gac etc aga aaa atg ttg ata 1343 
ser ASP Leu Val Gly Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 

435 440 445 

aac ect aga gca gca gac ect tta gea tta ggg gtt ega gtg tgg cca 1391 
Asn Pro Arg Ala Ala Asp Pro Leu Ala Leu Gly Val Arg Val Trp Pro 
450 455 460 
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caa gca ata cca cag ttt ttg att ggg cac ctt gat cgc ctt get get 1439 
Gin Ala lie Pro Gin Phe Leu lie Gly His Leu Asp Arg Leu Ala Ala 
465 470 475 

gca aaa tet gca ctg ggc caa ggc ggc tac gac ggg ttg ttc eta gga 1487 
Ala Lys Ser Ala Leu Gly Gin Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
480 485 490 495 

gga aac tac gtc gca gga gtt gee ttg ggc cga tgc ate gag ggt gcg 1535 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys lie Glu Gly Ala 

500 505 510 

« 

tac gag agt gcc tea caa gta tet gac ttc ttg acc aag tat gee tac 1583 
Tyr Glu Ser Ala Ser Gin Val Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 

515 520 525 

aag tga tggaagtagt gcatctctte attttgttgc atatacgagg tgaggctagg 1639 
Lys 

-atcggbaaaa catcatgaga ttctgtagtg tttetttaat tgaaaaaaca aattttagtg 1699 
.-atgcaatatg . tgc tet t tec tgtagttcga gcatgtacat cggtatggga taaagtagaa 1759 
-/ifeaagctattc .tgeaaaagca gtgatttttt ttgaaaaaaa aaaaaaaaaa aa 1811 



v:<210>^10 
.;*^:211>3528 
:<21'2>#PRT 

^-21^3>^Tri ticum«i ae'st-ivum 



-i-Aia^Thr*. Met s Ala. 


Thr- Ala. Thr' Val . Ala Ala 


Ala Ser. Pro 


Leu Arg Gly 


1 




5 




10 




15 . 


A^Arg^Val iThr. -.Gly^^ Arg^Pro^^His'%rg-- Val Arg-. 


Pro Arg Cys 


Ala: Thr. - Ala 




•20 




25 






30 


^?>"Sef^Ser- Ala' Thr 


Glu Thr- - Pro-Ala Ala 


Pro 


Gly Val Arg 


Leu Ser Ala 


35 






40 




45 




>Glu^^^Gys 'Val 


lie 


Val Gly 


Ala Gly lie 


Ser 


Gly Leu Cys 


Thr Ala Gin 


-50 






55 




60 




Ala Leu Ala 


Thr 


Arg Tyr 


Gly Val Ser 


Asp 


Leu Leu Val 


Thr Glu Ala 


65 




70 






75 


80 


Arg Asp Arg 


Pro 


Gly Gly 


Asn He Thr 


Thr 


Val Glu Arg 


Pro Asp Glu 






85 




90 




95 


Gly Tyr Leu 


Trp 


Glu Glu 


Gly Pro Asn 


Ser 


Phe Gin Pro 


Ser Asp Pro 




100 




105 






110 


Val Leu Thr 


Met 


Ala Val 


Asp Ser Gly 


Leu 


Lys Asp Asp Leu Val Phe 


115 






120 




125 




Gly Asp Pro 


Asn 


Ala Pro 


Arg Phe Val 


Leu 


Trp Glu Gly Lys Leu Arg 


130 






135 




140 




Pro Val Pro 


Ser 


Lys Pro 


Gly Asp Leu 


Pro 


Phe Phe Ser 


Leu Met Ser 


145 




150 






155 


160 


lie Pro Gly Lys 


Leu Arg 


Ala Gly Leu 


Gly 


Ala Leu Gly 


He Arg Pro 






165 




170 




175 


Pro Pro Pro Gly 


Arg Glu 


Glu Ser Val 


Glu 


Glu Phe Val 


Arg Arg Asn 



180 185 190 
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Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly 

195 200 205 

Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly 

210 215 220 

Lys val Trp Arg Leu Glu Glu He Gly Gly Ser He He Gly Gly Thr 

230 235 
lie Lys Ala He Gin Asp Lys Gly Lys Asn Pro Lys Pro Pro Arg Asp 

245 250 255 

Pro Arg Leu Pro Ala Pro Lys Gly Gin Thr Val Ala Ser Phe Arg Lys 

260 265 270 

Gly Leu Ala Met Leu Pro Asn Ala He Ala Ser Arg Leu Gly Ser Lys 

275 280 285 

Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ala Asp Asn Gin 

290 295 300 

Gly Tyr Val Leu Gly Tyr Glu Thr Pro Glu Gly Leu Val Ser Val Gin 
305 310 315 320 

Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val Ala Ser Asp He 

325 330 335 

Leu Arg Pro Leu Ser He Asp Ala Ala Asp Ala Leu Ser Lys Phe Tyr 

340 345 350 

Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala He 

355 360 365 

Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin Leu 

370 375 380 

His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser 
385 390 395 400 

ser Leu Phe Pro Asn Arg Ala Pro Ala Gly Arg Val Leu Leu Leu Asn 

405 410 415 

Tvr He Gly Gly Ser Thr Asn Thr Gly He Val Ser Lys Thr Glu Ser 

420 425 430 

Asp Leu Val Gly Ala Val Asp Arg Asp Leu Arg Lys Met Leu He Asn 

435 440 445 

pro Arg Ala Ala Asp Pro Leu Ala Leu Gly Val Arg Val Trp Pro Gin 

450 455 460 

Ala He Pro Gin Phe Leu He Gly His Leu Asp Arg Leu Ala Ala Ala 
465 470 475 480 

Lvs Ser Ala Leu Gly Gin Gly Gly Tyr Asp Gly Leu Phe Leu Gly Gly 

485 490 495 

Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys He Glu Gly Ala Tyr 

500 505 510 

Glu Ser Ala Ser Gin Val Ser Asp Phe Leu Thr Lys Tyr Ala Tyr Lys 
515 520 525 



<210> 11 
<211> 1847 
<212> DWA 
<213> Glycine max 

<220> 

<221> CDS 

<222> (55).. (1683) 

<223> soybean protox-1 

<400> 11 

ctttagcaca gtgttgaaga taacgaacga 



atagtgccat tactgtaacc aacc atg 
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Met 
1 

gtt tec gtc ttc aac gag ate eta ttc ceg ccg aac caa ace ctt ctt 105 

Val Ser Val Phe Asn Glu lie Leu Phe Pro Pro Asn Gin Thr Leu Leu 

5 10 15 

cgc ccc tec etc cat tec cca ace tet ttc ttc aec tct cce aet cga 153 

Arg Pro Ser Leu His Ser Pro Thr Ser Phe Phe Thr Ser Pro Thr Arg 

20 25 30 

aaa ttc cct ege tct cge cct aac cct att eta cgc tgc tec att gcg 201 

Lys Phe Pro Arg Ser Arg Pro Asn Pro lie Leu Arg Cys Ser lie Ala 

35 40 45 

gag gaa tec acc gcg tct ccg ccc aaa aec aga gac tec gee cce gtg 249 

Glu Glu Ser Thr Ala Ser Pro Pro Lys Thr Arg Asp Ser Ala Pro Val 

50 55 60 65 

gac tgc gtc gtc gtc ggc gga ggc gtc age ggc etc tgc ate gee cag 297 

Asp 'Cys Val Val Val Gly Gly Gly Val Ser Gly. Leu Cys lie Ala Gin 

70 75 80 

gee ''etc gee accaaa^ cac ' gee aat gee aac gtc gtc gtc aeg gag gee 345 

Ala-Leu Ala Thr.vLys^ His Ala Asn Ala Asn Val Val Val Thr* Glu Ala 

85 90 95 

v^ega**gac ^cgc. gte gge /gge^ aae - atc ace aeg atg gag agg gac gga tac 393 

Arg^-Aspf.-'Arg-^Valr-Gly*^Gly"*rAsn ;Ide Thr Thr Met Glu Arg Asp Gly Tyr 

VlOO .105 110 

• ctc^^tgg-'gaaa^gaaJLggc -ecc- aac-age- ttc cag. cct tct gat cca atg etc 441 

*fLeunltp Glu-GluAGly ; Pro,i.Asn ^Ser'-Phe Gin Pro Ser Asp -Pro Met Leu 

,il5 .'120 125 

-^acc atg.gtg.-gtgvgac .agt ggt; tta aag gat. gag ctt gtt ttg ggg gat 489 

Thr Met Val Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu Gly Asp 

130 135 140 145 

cct gat gea cct egg ttt gtg ttg tgg aac agg aag ttg agg ccg gtg 537 

Pro Asp Ala Pro Arg Phe Val Leu Trp Asn Arg Lys Leu Arg Pro Val 

150 155 160 

ccc ggg aag ctg act gat ttg cct ttc ttt gac ttg atg age att ggt 585 

Pro Gly Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Ser lie Gly 

165 170 175 



ggc aaa ate agg get ggc ttt ggt gcg ctt gga att egg cct eet cct 
Gly Lys lie Arg Ala Gly Phe Gly Ala Leu Gly He Arg Pro Pro Pro 
180 185 190 



633 



cca ggt cat gag gaa teg gtt gaa gag ttt gtt egt egg aac ctt ggt 681 
Pro Gly His Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly 
195 200 205 

gat gag gtt ttt gaa egg ttg ata gag eet ttt tgt tea ggg gtc tat 729 
Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly Val Tyr 
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210 215 



220 225 



gca ggc gat cct tea aaa tta agt atg aaa gca gca ttc ggg aaa gtt 777 
Ala Gly Asp pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Lys Val 

235 240 



230 



825 



873 



gga gga gca act aat act gga att tta teg aag acg gac agt gaa ctt 
Gly Gly Ala Thr Asn Thr Gly He Leu Ser Lys Thr Asp Ser Glu Leu 
435 440 445 



921 



969 



1017 



tag aag ctg gaa aaa aat ggt ggt age att att ggt gga act ttc aaa 
Trp Lys Leu Glu Lys Asn Gly Gly Ser He He Gly Gly Thr Phe Lys 

245 250 255 

gca ata caa gag aga aat gga get tea aaa eca ect ega gat ecg cgt 
Ala He Gin Glu Arg Asn Gly Ala Ser Lys Pro Pro Arg Asp Pro Arg 
260 265 270 

ctg eca aaa eca aaa ggt cag act gtt gga tct ttc egg aag gga ctt 
Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu 
275 280 285 

acc atg ttg cct gat gca att tct gee aga eta ggc aac aaa gta aag 
Thr Met Leu Pro Asp Ala He Ser Ala Arg Leu Gly Asn Lys Val Lys 
290 295 300 305 

tta tct tgg aag ctt tea agt att agt aaa ctg gat agt gga gag tac 
Leu -Ser Trp Lys Leu Ser Ser He Ser Lys Leu Asp Ser Gly Glu Tyr 

310 315 320 

agt ttg aea tat gaa aca eca gaa gga gtg gtt tct ttg cag tge aaa 1065 
Ser Leu Thr Tyr Glu Thr Pro Glu Gly Val Val Ser Leu Gin Cys Lys 

325 330 335 

act gtt gtc ctg acc att cct tec tat gtt get agt aea ttg ctg cgt 1113 
Thr Val Val Leu Thr He Pro Ser Tyr Val Ala Ser Thr Leu Leu Arg 
340 345 350 

cet ctg tct get get get gca gat gca ctt tea aag ttt tat tac cct 1161 
Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser Lys Phe Tyr Tyr Pro 
.355 360 365 

cea gtt get gca gtt tec ata tec tat cea aaa gaa get att aga tea 1209 
Pro Val Ala Ala Val Ser He Ser Tyr Pro Lys Glu Ala He Arg Ser 
370 375 380 385 

gaa tge ttg ata gat ggt gag ttg aag ggg ttt ggt caa ttg cat eca 1257 
Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro 

390 395 400 

cgt age caa gga gtg gaa aca tta gga act ata tac age tea tea eta 1305 
Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser Ser Leu 

405 410 415 

ttc ccc aac cga gca cea cct gga agg gtt eta etc ttg aat tac att 1353 
Phe Pro Asn Arg Ala Pro Pro Gly Arg Val Leu Leu Leu Asn Tyr He 
420 425 430 



1401 
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gtg gaa aca gtt gat cga gat ttg agg aaa ate ctt ata aac cca aat 1449 
Val Glu Tlir Val Asp Arg Asp Leu Arg Lys lie Leu He Asn Pro Asn 
450 455 460 465 

gcc cag gat cca ttt gta gtg ggg gtg aga ctg tgg act caa get att 1497 
Ala Gin Asp Pro Phe Val Val Gly Val Arg Leu Trp Pro Gin Ala He 

470 475 480 

cca cag ttc tta gtt gge cat ctt gat ctt eta gat gtt get aaa get 1545 
Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Asp Val Ala Lys Ala 

485 490 495 

tct ate aga aat act ggg ttt gaa ggg etc ttc ctt ggg ggt aat tat 1593 
Ser He Arg Asn Thr Gly Phe Glu Gly Leu Phe Leu Gly Gly Asn Tyr 
500 505 510 

gtg tct ggt gtt gee ttg gga cga tgc gtt gag gga gee tat gag gta 1641 
Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Val 
515 520 525 

ligca.gct gaa gta aac gat ttt etc aca aat aga gtg tae aaa 1683 
• Ala '.Ala Glu Val -Asn Asp Phe Leu Thr Asn Arg Val Tyr Lys 

'^^530^ 535 540 

'«-feagtagcagt--ttttgttttt-'^gtggtggaat gggtgatggg aetctcgtgt tecattgaat 1743 
tataafeaatgMtgaaagtttc- tcaaattcgt tegataggtt tttggcggct tctattgetg 1803 
^?>ataat~gtaaa, ateetettta'.-agtttgaaaa aaaaaaaaaa aaaa 1847 



^2M>^543 
-r<2<b2»t><PRT 

:vc2a-3>^Glycine max * . , 

t<400> 12 

# Met: Val Ser Val Phe Asn Glu He Leu Phe Pro Pro Asn Gin Thr Leu 
15 10 15 

Leu Arg Pro Ser Leu His Ser Pro Thr Ser Phe Phe Thr Ser Pro Thr 

20 25 30 

Arg Lys Phe Pro Arg Ser Arg Pro Asn Pro He Leu Arg Cys Ser He 
35 40 45 

Ala Glu Glu Ser Thr Ala Ser Pro Pro Lys Thr Arg Asp Ser Ala Pro 
50 55 60 

Val Asp Cys Val Val Val Gly Gly Gly Val Ser Gly Leu Cys He Ala 
65 70 75 80 

Gin Ala Leu Ala Thr Lys His Ala Asn Ala Asn Val Val Val Thr Glu 

85 90 95 

Ala Arg Asp Arg Val Gly Gly Asn He Thr Thr Met Glu Arg Asp Gly 
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100 



105 



110 



Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro Met 
115 120 125 

Leu Thr Met Val Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu Gly 
130 135 140 

Asp Pro Asp Ala Pro Arg Phe Val Leu Trp Asn Arg Lys Leu Arg Pro 
145 150 155 160 

Val Pro Gly Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He 

165 170 175 

Gly Gly Lys He Arg Ala Gly Phe Gly Ala Leu Gly He Arg Pro Pro 

180 185 190 

Pro Pro Gly His Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu 
195 200 205 

Gly Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly Val 
210 215 220 

Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Lys 
225 230 235 240 

Val Trp Lys Leu Glu Lys Asn Gly Gly Ser He He Gly Gly Thr Phe 

245 250 255 

Lys Ala He Gin Glu Arg Asn Gly Ala Ser Lys Pro Pro Arg Asp Pro 

260 265 270 

Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys Gly 
275 280 285 

Leu Thr Met Leu Pro Asp Ala He Ser Ala Arg Leu Gly Asn Lys Val 
290 295 300 

Lys Leu Ser Trp Lys Leu Ser Ser He Ser Lys Leu Asp Ser Gly Glu 
305 310 315 320 

Tyr Ser Leu Thr Tyr Glu Thr Pro Glu Gly Val Val Ser Leu Gin Cys 

325 330 335 

Lys Thr Val Val Leu Thr He Pro Ser Tyr Val Ala Ser Thr Leu Leu 

340 345 350 

Arg Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser Lys Phe Tyr Tyr 
355 360 365 

Pro Pro Val Ala Ala Val Ser He Ser Tyr Pro Lys Glu Ala He Arg 
370 375 380 

Ser Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His 
385 390 395 400 

Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser Ser 
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405 



410 



415 



Leu Phe Pro Asn Arg Ala Pro Pro Gly Arg Val Leu Leu Leu Asn Tyr 

420 425 430 

lie Gly Gly Ala Thr Asn Thr Gly lie Leu Ser Lys Thr Asp Ser Glu 
435 440 445 

Leu Val Glu Thr Val Asp Arg Asp Leu Arg Lys lie Leu lie Asn Pro 
450 455 460 

Asn Ala Gin Asp Pro Phe Val Val Gly Val Arg Leu Trp Pro Gin Ala 
465 470 475 480 

lie Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Asp Val Ala Lys 

485 490 495 

Ala Ser lie Arg Asn Thr Gly Phe Glu Gly Leu Phe Leu Gly Gly Asn 

500 505 510 

Tyr Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu 
515 520 525 



Val» Ala Ala Glu Val Asn Asp Phe Leu Thr Asn Arg Val Tyr Lys 
.530 535 540 



T'<211>;r.583 . 
--■K2<L5=#BNA 

^ ^ <2 1-3 >:^Ar abidops i s -^^ ttia 1 iana 

V^<;2'20> 

21>^.mis c^f-ea t-ur e 
^|<22^>•l,^(.l) . ; e583) 

-<2'2'3>^arabidopsis protox-1 promoter 
f40,0> 13 

gaattccgat cgaattatat aattatcata aatttgaata agcatgttgc cttttattaa 60 

agaggtttaa taaagtttgg taataatgga ctttgacttc aaactcgatt ctcatgtaat 120 

taattaatat ttacatcaaa atttggtcac taatattacc aaattaatat actaaaatgt 180 

taattcgca.a ataaaacact aattccaaat aaagggtcat tatgataaac acgtattgaa 240 

cttgataaag caaagcaaaa ataatgggtt tcaaggtttg ggttatatat gacaaaaaaa 300 

aaaaaaggtt tggttatata tctattgggc ctataaccat gttatacaaa tttgggccta 360 

actaaaataa taiaaataaac gtaatggtcc tttttatatt tgggtcaaac ccaactctaa 420 

acccaaacca aagaaaaagt atacggtacg gtacacagac ttatggtgtg tgtgattgca 480 

ggtgaatatt tctcgtcgtc ttctcctttc ttctgaagaa gattacccaa tctgaaaaaa 540 

accaagaagc tgacaaaatt ccgaattctc tgcgatttcc atg 583 



<210> 14 
<211> 3848 
<212> DNA 
<213> Zea mays 

<220> 
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<221> misc_f eature 

<222> (1) . . (3848) 

<223> maize protox-1 promoter 



60 



180 
240 
300 



<400> 14 

tcgatctttc taggctgatc cccaaatctt cctccgaagc ccctggcgcc tctgcccctt 
ggagctggtg gcctgaaaga gctttgctgt tgccccgaag attgtgaggt atattgtgac 120 
ctctgagact gacttccttt gtcgtcactt tgagtggagt tatggattga cctgacgtgc 
ctcagatgga ttcttcctcc gaagcccctg gtcatttcgg agaatctgta atcttattcc 
cttctttggc gaaaatctgt cagcttggat gtactcatcc atcttctgaa gcagcttctc 
cagagtttgt ggaggcttcc tggcgaaata ttgggctgta ggtcctggac gaagaccctt 360 
gatcatggcc tcaatgacaa tctcattggg caccgtaggc gcttgtgccc tcaatcgcaa 420 
gaaccttcgt acatatgcct gaaggtattc ttcgtgatct tgtgtgcatt ggaacagagc 480 
ctgagctgtg accgacttcg tttgaaagcc ttggaagcta gtaaccaaca tgtgcttaag 540 
cttctgccac gacgtgatag tccctggccg aagagaagaa taccatgttt gggctacatt 600 
ccggactgcc atgacgaagg acttcgccat gactacagtg ttgaccccat acgaagatat 660 
agttgcttcg tagctcatca gaaactgctt tggatctgag tgcccatcat acatggggag 720 
ctgaggtggc ttgtatgatg ggggccatgg ggtagcctgc agttctgctg ccaagggaga 780 
agcatcatca aaagtaaagg catcatgatt aaaatcatca taccatccat cctcgttgaa 840 
taagccttct tgacgaagct ccctgtgttg gggccttcga tcttgttcat cttgaacaag 900 
atgacgcact tcttcagtgg cttcgtcgat ctttctttgg agatcagcca gtcgcaccat 960 
cttctccttc tttctttgta cttgttgatg gatgatctcc atgtccctga tctcttggtc 1020 
caactcctcc tcttggagtg tcagactggt ggctttcctc ttctggcttc gagcctctcg 1080 
aagagaaaga gtttcttgat ttgggtccag cggctgcagt gcagtggtcc ctggtgctga 1140 
agctttcttc ggtggcatga caaaggtcag tgcttgccga aggtggtcga aaagggttca 1200 
ctagaggtgg gagccaatgt tggggacttc tcaagtgcta tgagttaaga acaaggcaac 1260 
acaaaatgtt aaatattaat agctttcatc tttcgaagca ttatttccct ttgggtataa 1320 
tgatcttcag acgaaagagt ccttcatcat tgcgatatat gttaatagaa ggaggagcat 1380 
atgaaatgta agagacaaca tgaacaatcg tgtagcattg ttaattcatc atcattttat 1440 
tattatggaa aaatagaaac aatattgaat tacaaatgta cctttggctt gacagaagat 1500 
aaaagtacaa gcttgacgca cgagcaagta caagtcagtg tgaacagtac gggggtactg 1560 
ttcatctatt tataggcaca ggacacagcc tgtgagaaat tacagtcatg ccctttacat 1620 
ttactattga cttatagaaa aatctatgag gactggatag ccttttcccc tttaagtcgg 1680 
tgcctttttc cgcgattaag ccgaatctcc cttgcgcata gcttcggagc atcggcaacc 1740 
ttcgtcacga tcatgccctt ctcattgtgt atgcttttaa tcctgaattc gaaggtacct 1800 
gtccataaac catacttgga agacattgtt aaattatgtt tttgaggacc ttcggaggac 1860 
gaaggccccc aacagtcgtg tttttgagga ccttcggaag atgaaggccc ccaacaagac 1920 
ctatccataa aaccaaccta tccacaaaac cgaccccatt cacccttcat ttgcctcacc 1980 
aacaacccta attaggttgt tggtttaaat tttttagggt caatttggtc atcaccatcc 2040 
actgtcactc cacaaactca atatcaataa acagactcaa tcacccaaac tgaccatacc 2100 
cataaaaccg ccccaccctt ctagcgcctc gccagaaacc agaaaccctg attcagagtt 2160 
caaacttaaa acgaccataa ctttcacctt ggaactcgaa tcaggtccat ttttttccaa 2220 
atcacacaaa attaaatttc gcatccgata atcaagccat ctcttcacta tggttttaag 2280 
tgttgctcac actagtgtat ttatggacta atcacctgtg tatctcatac aataacatat 2340 
cagtacatct aagttgttac tcaattacca aaaccgaatt atagccttcg aaaaaggtta 2400 
tcgactagtc actcaattac caaaactaaa ctttagactt tcatgtatga catccaacat 2460 
gacactgtac tggactaaac cacctttcaa gctacacaag gagcaaaaat aactaatttt 2520 
cgtagttgta ggagctaaag tatatgtcca caacaatagt taagggaagc ccccaaggac 2580 
ttaaaagtcc ttttacctct tgaaactttt gtcgtggtct actttttcac tttaaacttc 2640 
aaaatttgac attttatcac cccttaactc ttaaaaccat ttaaattaca ttcttactag 2700 
attatagatg attttgttgt gaaaagtttt taagacatgt ttacacattg attaaaatca 2750 
tttgttcaat ttcctagagt taaatctaat cttattaaaa ctattagaga tactttcacg 2820 
agctctaaat atttttattt tttcattatg gaattttgtt agaattctta tagacctttt 2880 
tttgtggttt aaaagccttg ccatgttttt aacaagtttt ttttctattt tttgaaattt 2940 
tcttggaaac cacttctaac ccggtagaag atttattttg ctacacttat atctacaaca 3000 
aaatcaactt atgaaattgt cttggaaact acctctaacc cggtagaatg aatttgaatg 3060 
aaaattaaac caacttacgg aatcgcccaa catatgtcga ttaaagtgga tatggataca 3120 
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tatgaagaag ccctagagat aatctaaatg gtttcagaat tgagggttat tttttgaagt 3180 

ttgatgggaa gataagacca taacggtagt tcacagagat aaaagggtta tttttttcag 3240 

aaatatttgt gctgcaattg atcctgtgcc tcaaattcag cctgcaacca aggccaggtt 3300 

ctagagcgaa caaggcccac gtcacccgtg gcccgtcagg cgaagcaggt cttgtgcaga 3360 

ctttgagagg gattggatat caacggaacc aatcacgcac ggcaatgcga ttcccagccc 342 0 

acctgtaacg ttccagtggg ccatccttaa ctccaagccc aacggcccta ccccatctcg 3480 

tcgtgtcatc cactccgccg cacaggcgct cagctccgca acgccgccgg aaatggtcgc 3540 

cgccacagcc accgccatgg ccaccgctgc atcgccgcta ctcaacggga cccgaatacc 3600 

tgcgcggctc cgccatcgag gactcagcgt gcgctgcgct gctgtggcgg gcggcgcggc 3660 

cgaggcaccg gcatccaccg gcgcgcggct gtccgcggac tgcgttgtgg tgggcggagg 3720 

catcagtggc ctctgcaccg cgcaggcgct ggccacgcgg cacggcgtcg gggacgtgct 3780 

tgtcacggag gcccgcgccc gccccggcgg caacattacc accgtcgagc gccccgagga 3840 

agggtacc 3848 



<210> 15 
<211> 1826 
<212> DNA 

<213> Gossypium hirsutum 

■'-<220> 
. .<221>' CDS 
<222> (-31) . . (1647) 

"<223> cotton protox-1 -coding- sequence 
.<400> 15 

. cctctcgctC' gcctggcccc-accaccaatcv atg acg get eta ate gac ctt tet 54 

-Met Thr Ala Leu lie- Asp Leu Ser 
1 5 

4:ctt. ictG^^cgt*.^tcc\itcg ,c.cc tccrsg^^ ttc tee ata rccc cac cac 102 

i^^Iieu -Leu^^-Arg^SerNSer Pro' Ser^-ValVSer>^Pro Phe Ser lie Pro His His 
^10 15 20 • 

t*cag :cat^ccg ccc -egc' ttt cgt -aaa- cot- etc cga tgc tec- etc 150 

- Gln7;'HisrPro Pro-Arg^:Phe -Arg.-'Lys: Pro^^Phe Lys Leu Arg Cys Ser Leu 
•25 30 35 40 

gcc gag iggt ccc acg att tec tea tct aaa ate gac ggg gga gaa tea 198 
Ala Glu.vGly Pro Thr lie Ser Ser Ser Lys lie Asp Gly Gly Glu Ser 

45 50 55 

tec ate gcg gat tgc gtc ate gtt gga ggt ggt ate agt gga ett tgc 246 
Ser lie Ala Asp Cys Val lie Val Gly Gly Gly lie Ser Gly Leu Cys 

60 65 70 

att get caa get etc gee acc aag cac cgt gac gtc get tec aat gtg 294 
He Ala Gin Ala Leu Ala Thr. Lys His Arg Asp Val Ala Ser Asn Val 
75 80 85 

att gtg acg gag gcc aga gac cgt gtt ggt gge aac ate act ace gtt 342 
He Val Thr Glu Ala Arg Asp Arg Val Gly Gly Asn He Thr Thr Val 
90 95 100 

gag aga gat gga tat ctg tgg gaa gaa ggc cce aae agt ttt eag eec 390 
Glu Arg Asp Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro 
105 110 115 120 
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tec gat cct att eta ace atg gcc gtg gat agt gga ttg aag gac gat 438 
Ser Asp Pro lie Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp 

125 130 135 

ttg gtt tta ggt gac cct aat gca ccg cga ttt gta eta tgg gag gga 486 
Leu Val Leu Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly 

140 145 150 

aaa eta agg cct gtg ccc tec aag cca acc gac ttg ccg ttt ttt gat 534 
Lys Leu Arg Pro Val Pro Ser Lys Pro Thr Asp Leu Pro Phe Phe Asp 
155 160 165 

ttg atg age att get gga aaa ctt agg get ggg ttc ggg get att ggc 582 
Leu Met Ser lie Ala Gly Lys Leu Arg Ala Gly Phe Gly Ala lie Gly 
170 175 180 

att egg cct cec cct ccg ggt tat gaa gaa teg gtg gag gag ttt gtg 630 
He Arg Pro Pro Pro Pro Gly Tyr Glu Glu Ser Val Glu Glu Phe Val 
185 190 195 200 

cgc cgt aat ctt ggt get gag gtt ttt gaa egc ttt att gaa cca ttt. 678 
Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Phe He Glu Pro Phe 

205 210 215 

tgt tea ggt gtt tat* gca ggg gat cct tea aaa tta age atg aaa gca 726 
Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala 

220 225 230 

gca ttt gga aga gta tgg aag eta gaa gag att ggt ggc age ate att 774 
Ala Phe Gly Arg Val Trp Lys Leu Glu Glu He Gly Gly Ser He He 
235 240 245 

ggt ggc act ttc- aag aca ate cag gag aga aat aag aca cct aag cca 822 
Gly Gly Thr Phe Lys Thr He Gin Glu Arg Asn Lys Thr Pro Lys Pro 
250 255 260 

ecc aga gac ccg cgt ctg cca aaa ccg aag ggc caa aca gtt gga tct 870 
Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser 
265 270 275 280 

ttt agg aag gga ctt acc atg ctg cct gag gca att get aae agt ttg 918 
Phe Arg Lys Gly Leu Thr Met Leu Pro Glu Ala He Ala Asn Ser Leu 

285 290 295 

ggt age aat gta aaa tta tet tgg aag ctt tec agt att acc aaa ttg 966 
Gly Ser Asn Val Lys Leu Ser Trp Lys Leu Ser Ser He Thr Lys Leu 

300 305 310 

ggc aat gga ggg tat aae ttg aca ttt gaa aca cct gaa gga atg gta 1014 
Gly Asn Gly Gly Tyr Asn Leu Thr Phe Glu Thr Pro Glu Gly Met Val 
315 320 325 

tct ctt cag agt aga agt gtt gta atg acc att cca tee cat gtt gcc 1062 
Ser Leu Gin Ser Arg Ser Val Val Met Thr He Pro Ser His Val Ala 
330 335 340 
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agt aac ttg ttg cat cct etc teg get get get gca gat gca tta tec 1110 
Ser Asn Leu Leu His Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser 
345 350 355 360 

caa ttt tat tat cct cca gtt gea tea gtc aea gtc tec tat cca aaa 1158 
Gin Phe Tyr Tyr Pro Pro Val Ala Ser Val Thr Val Ser Tyr Pro Lys 

365 370 375 

gaa gee att cga aaa gaa tgt ttg att gat ggt gaa ctt aag ggg ttt 1206 
Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Lys Gly Phe 

380 385 390 

ggc cag ttg cac cca cgc age caa gga att gaa act tta ggg acg ata 1254 
Gly Gin Leu His Pro Arg Ser Gin Gly lie Glu Thr Leu Gly Thr lie 
395 400 405 

tac agt tea tea ctt ttc ccc aat cga get cca tct ggc agg gtg ttg 1302 
Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Ser Gly Arg Val Leu 
410 415 420 

ctc""^ttg aac tac ata gga gga get acc aac act gga att ttg tec aag 1350 
Leu- Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Leu Ser Lys 
425 430 435 440 

act gaa ggg gaa ctt^gta gaa gca gtt gat cgt gat ttg aga aaa atg 1398 
Thrv^Glu Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met 

445 450 455 

.*--cfct?^^^ata''>aat^cct aat.igcaraag gat cct ctt gtt ttg ggt gta aga gta 1446 
;^Leu^Ile>Asn'>Pro-AsnvAaa: Lys Asp Pro Leu Val Leu Gly Val Arg Val 

^A60 465 470 

-^tgsr^'Cea 5daa4^gec^att ccaTtcag ttc- ttg gtt ggt cat ttg gat etc ctt 1494 
-^iTrp^Pro .:.Lys- AO.a Ile^^^Pro ^Gln Phe Leu Val Gly His - Leu . Asp , Leu - Leu 
• 4r75 480 485 

- gat.*;agt gca aaa atg get* etc agg- gat tct ggg ttt cat gga ctg ,ttt 1542 
AspnSer Ala Lys Met Ala Leu Arg Asp Ser Gly Phe His Gly Leu Phe 
.490 495 500 

ctt ggg ggc aac tat gta tct ggt gtg gca tta gga egg tgt gtg gaa 1590 
Leu Gly Gly Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu 
505^ 510 515 520 

ggt get tac gag gtt gca get gaa gtg aag gaa ttc etg tea caa tat 1638 
Gly Ala Tyr Glu Val Ala Ala Glu Val Lys Glu Phe Leu Ser Gin Tyr 

525 530 535 

gca tac aaa taatattgaa attcttgtca ggctgcaaat gtagaagtca 1687 
Ala Tyr Lys 

gttattggat agtatctctt tagctaaaaa attgggtagg gttttttttg ttagttcctt 1747 
gaccactttt tggggtttte attagaactt catatttgta tateatgttg caatatcaaa 1807 
aaaaaaaaaa aaaaaaaaa 1826 
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<210> 16 
<211> 539 
<212> PRT 

<213> Gossypium hirsutum 
<400> 16 

Met Thr Ala Leu He Asp Leu Ser Leu Leu Arg Ser Ser Pro Ser Val 
X 5 10 15 

Ser Pro Phe Ser He Pro His His Gin His Pro Pro Arg Phe Arg Lys 

20 25 30 

Pro Phe Lys Leu Arg Cys Ser Leu Ala Glu Gly Pro Thr He Ser Ser 
35 40 45 

Ser Lys He Asp Gly Gly Glu Ser Ser He Ala Asp Cys Val He Val 
50 55 60 

Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu Ala Thr Lys 
65 70 75 80 

His Arg Asp Val Ala Ser Asn Val He Val Thr Glu Ala Arg Asp Arg 

85 90 95 

Val Gly Gly Asn He Thr Thr Val Glu Arg Asp Gly Tyr Leu Trp Glu 

100 105 110 

Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro He Leu Thr Met Ala 
115 120 125 

Val Asp Ser Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro Asn Ala 
130 135 140 

Pro Arg Phe Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys 
145 150 155 160 

Pro Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He Ala Gly Lys Leu 

165 170 175 

Arg Ala Gly Phe Gly Ala He Gly He Arg Pro Pro Pro Pro Gly Tyx 

180 185 190 

Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val 
195 200 205 

Phe Glu Arg Phe He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp 
210 215 220 

Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Arg Val Tirp Lys Leu 
225 230 235 240 

Glu Glu He Gly Gly Ser He He Gly Gly Thr Phe Lys Thr He Gin 

245 250 255 

Glu Arg Asn Lys Thr Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys 

260 265 270 
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Pro Lys Gly Gin Ttir Val Gly Ser Phe Arg Lys Gly Leu Thr Met Leu 
275 280 285 

Pro Glu Ala lie Ala Asn Ser Leu Gly Ser Asn Val Lys Leu Ser Trp 
290 295 300 

Lys Leu Ser Ser lie Thr Lys Leu Gly Asn Gly Gly Tyr Asn Leu Thr 
305 310 315 320 

Phe Glu Thr Pro Glu Gly Met Val Ser Leu Gin Ser Arg Ser Val Val 

325 330 335 

Met Thr lie Pro Ser His Val Ala Ser Asn Leu Leu His Pro Leu Ser 

340 345 350 

Ala Ala Ala Ala Asp Ala Leu Ser Gin Phe Tyr Tyr Pro Pro Val Ala 
355 360 365 

Ser Val Thr Val Ser Tyr Pro Lys Glu Ala lie Arg Lys Glu Cys Leu 
370 375 380 

.I6.e^Asp.>.Gly >Glu-Leu Lys .Gly Phe Gly- Gin Leu His Pro Arg Ser Gin 
385 3'90 395 400 

Gly lie Glu Thr- .'Leu -Gly Thr lie Tyr Ser Ser Ser Leu Phe Pro Asn 

':405 410 415 

,,^^Arg;^AlaVPro*.:^Ser- Gly^ Arg' Valri>Leu Leu s-Leu Asn Tyr . lie Gly Gly Ala 

*^420 425 430 

'^ThrfTfAsn'^Thr^Gly/ Ile- Leu-*'^Ser vLys T Glu Leu Val -Glu Ala 

. ^435 fmO 445 

WalvAsp Arg =Asp-Leu"-^Arg-Lys>* Met Leu- He Asn Pro Asn Ala Lys Asp 
:450 .'455 '460 

Pro Leu Val Leu Gly Val Arg Val Trp Pro Lys Ala He Pro Gin Phe 
465 470 475 480 

Leu Val Gly His Leu Asp Leu Leu Asp Ser Ala Lys Met Ala Leu Arg 

485 490 495 

Tvsp Ser Gly Phe His Gly Leu Phe Leu Gly Gly Asn Tyr Val Ser Gly 

500 505 . 510 

Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Val Ala Ala Glu 
515 520 525 

Val Lys Glu Phe Leu Ser Gin Tyr Ala Tyr Lys 
530 535 



<210> 17 
<211> 1910 
<212> DNA 
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<213> Beta vulgaris 

<220> 

<221> CDS 

<222> (1) . . (1680) 

<223> sugar beet protox-1 coding sequence 
<400> 17 

atg aaa tea atg gcg tta tea aac tgc att cca eag aca cag tgc atg 48 
Met Lys Ser Met Ala Leu Ser Asn Cys lie Pro Gin Thr Gin Cys Met 
15 10 15 

cea ttg cge age age ggg cat tae agg ggt aat tgt ate atg ttg tea 96 
Pro Leu Arg Ser Ser Gly His Tyr Arg Gly Asn Cys lie Met Leu Ser 

20 25 30 

att cca tgt agt tta att gga aga cga ggt tat tat tea cat aag aag 144 
lie Pro Cys Ser Leu lie Gly Arg Arg Gly Tyr Tyr Ser His Lys Lys 
35 40 45 

agg agg atg age atg agt tgc age aca age tea ggc tea aag tea gcg 192 
Arg Arg Met Ser Met Ser Cys Ser Thr Ser Ser Gly Ser Lys Ser Ala 
50 55 60 

gtt aaa gaa gca gga tea gga tea ggt gea gga gga ttg eta gae tgc 240 
Val Lys Glu Ala Gly Ser Gly Ser Gly Ala Gly Gly Leu Leu Asp Cys 
65 70 75 80 

gta ate gtt gga ggt gga att age ggg ett tgc ate gcg cag get ett 288 
Val lie Val Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu 

85 90 95 

tgt aca aaa cac tec tct tec tet tta tec cea aat ttt ata gtt aca 336 
Cys Thr Lys His Ser Ser Ser Ser Leu Ser Pro Asn Phe He Val Thr 

100 105 110 

gag gcc aaa gae aga gtt ggc gge aac ate gtc act gtg gag gee gat 384 
Glu Ala Lys Asp Arg Val Gly Gly Asn He Val Thr Val Glu Ala Asp 
115 120 125 

ggc tat ate tgg gag gag gga ecc aat age ttc cag cct tec gae gcg 432 
Gly Tyr He Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Ala 
130 135 140 

gtg etc acc atg gcg gtc gae agt ggc ttg aaa gat gag ttg gtg etc 480 
Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu 
145 150 155 160 

gga gat cce aat get cct cgc ttt gtg eta tgg aat gae aaa tta agg 528 
Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Asn Asp Lys Leu Arg 

165 170 175 

cce gta cct tee agt etc acc gae etc cct ttc ttc gae etc atg acc 576 
Pro Val Pro Ser Ser Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Thr 

180 185 190 

att ccg ggc aag att agg get get ett ggt get etc gga ttt cge cct 524 
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lie Pro Gly Lys lie Arg Ala Ala Leu Gly Ala Leu Gly Phe Arg Pro 
195 200 205 

tct cct cca cct cat gag gaa tct gtt gaa cac ttt gtg cgt cgt aat 672 
Ser Pro Pro Pro His Glu Glu Ser Val Glu His Phe Val Arg Arg Asn 
210 215 220 

etc gga gat gag gtc ttt gaa cgc ttg att gaa ccc ttt tgt tea ggt 720 
Leu Gly Asp Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser Gly 
225 230 235 240 

gtg tat gcc ggt gat cct gcc aag ctg agt atg aaa get get ttt ggg 758 
Val Tyr Ala Gly Asp Pro Ala Lys Leu Ser Met Lys Ala Ala Phe Gly 

245 250 255 

aag gtc tgg aag ttg gag caa aag ggt ggc age ata att ggt ggc act 816 
Lys Val Trp Lys Leu Glu Gin Lys Gly Gly Ser lie lie Gly Gly Thr 

260 265 270 

etc aaa get ata eag gaa aga ggg agt aat eet aag ccg ccc cgt gac 864 
-^Leu 'Lys Ala lie Gin Glu Arg Gly Ser Asn Pro Lys Pro Pro Arg Asp 
275 280 285 

-eag^ege etc cct aaa- eea aag ggt cag act gtt gga tec ttt aga aag 912 
. Glnr^^Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys 
-'290 295 .300 

gga;* etc;, gtt atg ttg cct acc, gee* att tct; get cga ett ggc agt aga 960 
• Gly.rLeu:-KVal;;Met^ Leu-Pro Thr Ala. Xle- Ser' Ala^ Arg Leu . Gly Ser Arg 
-305 ;310 .315 320 

ir gtg^aaai'-eta tct' tgg-: acc ett tet-^;agt ate -gta aag tea -etc aat gga 1008 
i^L-Val" iLys.^Leu;*^ Leu 'Ser:.-'Ser lie ^Val Lys- Ser Leu Asn Gly 

325 330 335 

;,^gaafi>Jfeat -agt ctg act" tat gat acc -eea gat ggc ttg gtt tct gta aga 1056 
Glu^ii;Tyr-^Ser Leu- Thr Tyr Asp Thr Pro Asp . Gly Leu Val Ser Val Arg 

340 345 350 

..acG.i,aaa agt gtt gtg atg act gtt eea tea tat gtt gea agt agg ctt 1104 
Thr Lys Ser Val Val Met Thr Val Pro Ser Tyr Val Ala Ser Arg Leu 
355 360 365 

ctt cgt cca ctt tea gac tet get gea gat tct ctt tea aaa ttt tac 1152 
Leu Arg Pro Leu Ser Asp Ser Ala Ala Asp Ser Leu Ser Lys Phe Tyr 
370 375 380 

tat cca cca gtt gea gea gtg tea ett tec tat eet aaa gaa gcg ate 1200 
Tyr Pro Pro Val Ala Ala Val Ser Leu Ser Tyr Pro Lys Glu Ala lie 
385 390 395 400 

aga tea gaa tgc ttg att aat ggt gaa ctt caa ggt ttc ggg eaa eta 1248 
Arg Ser Glu Cys Leu lie Asn Gly Glu Leu Gin Gly Phe Gly Gin Leu 

405 410 415 

cat ccc ege agt cag ggt gtg gaa ace ttg gga aca att tat agt teg 1296 
His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser 
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420 425 430 

tct ctt ttc cct ggt cga gca cca cct ggt agg ate ttg ate ttg age 

Ser Leu Phe Pro Gly Arg Ala Pro Pro Gly Arg He Leu He Leu Ser 

435 440 445 



cct gat gca aaa ctt cct cgt gta ctg ggt gtg aga gta tgg cct caa 
Pro Asp Ala Lys Leu Pro Arg Val Leu Gly Val Arg Val Trp Pro Gin 

485 490 495 



1344 



tac ate gga ggt get aaa aat cct ggc ata tta aac aag teg aaa gat 1392 
Tyr He Gly Gly Ala Lys Asn Pro Gly He Leu Asn Lys Ser Lys Asp 
450 455 460 

gaa ctt gee aag aca gtt gae aag gac ctg aga aga atg ctt ata aat 1440 
Glu Leu Ala Lys Thr Val Asp Lys Asp Leu Arg Arg Met Leu He Asn 
465 470 475 480 



1488 



gca ata ccc cag ttt tct att ggg cac ttt gat ctg etc gat get gca 1536 
Ala He Pro Gin Phe Ser He Gly His Phe Asp Leu Leu Asp Ala Ala 

500 505 510 

aaa get get ctg aca gat aca ggg gte aaa gga ctg ttt ctt ggt ggc 1584 
Lys Ala Ala Leu Thr Asp Thr Gly Val Lys Gly Leu Phe Leu Gly Gly 
515 520 525 

aac tat gtt tea ggt gtt gcc ttg ggg egg tgt ata gag ggt get tat 1632 
Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys He Glu Gly Ala Tyr 
530 535 540 

gag tct gca get gag gta gta gat ttc etc tea cag tac tea gac aaa 1680 
Glu Ser Ala Ala Glu Val Val Asp Phe Leu Ser Gin Tyr Ser Asp Lys 
545 550 555 560 

tagagcttca geatcctgtg taattcaaca eaggectttt tgtatetgtt gtgegegcat 1740 

gtagtctggt egtggtgcta ggattgatta gttgctctge tgtgtgatce acaagaattt 1800 

tgatggaatt ttteeagatg tgggeattat atgttgctgt cttataaatc cttaatttgt 1860 

acgtttagtg aattacaccg eatttgatga ctaaaaaaaa aaaaaaaaaa 1910 



<210> 18 
<211> 560 
<212> PRT 

<213> Beta vulgaris 
<400> 18 

Met Lys Ser Met Ala Leu Ser Asn Cys He Pro Gin Thr Gin Cys Met 
15 10 15 

Pro Leu Arg Ser Ser Gly His Tyr Arg Gly Asn Cys He Met Leu Ser 

20 25 30 

He Pro Cys Ser Leu He Gly Arg Arg Gly Tyr Tyr Ser His Lys Lys 
35 40 45 
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Arg Arg Met Ser Met Ser Cys Ser Thr Ser Ser Gly Ser Lys Ser Ala 
50 55 60 

Val Lys Glu Ala Gly Ser Gly Ser Gly Ala Gly Gly Leu Leu Asp Cys 
65 70 75 80 

Val He Val Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu 

85 90 95 

Cys Thr Lys His Ser Ser Ser Ser Leu Ser Pro Asn Phe He Val Thr 

100 105 110 

Glu Ala Lys Asp Arg Val Gly Gly Asn He Val Thr Val Glu Ala Asp 
115 120 125 

Gly Tyr He Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Ala 
130 135 140 

Val Leu Thr Met Ala Val Asp Ser Gly . Leu Lys Asp Glu Leu Val Leu 
145 150 155 160 

-Gly -Asp Pro- Asn Ala Pro Arg Phe Val Leu Trp Asn Asp -Lys Leu Arg 

.165 170 175 

-Pro-A^al-.^-Prq Ser Ser Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Thr 

180 185 190 

^<'ileviPrd^Gly^jLys^;He.^^Arg-Ala -Ala Leu Gly Ala Leu Gly Phe Arg Pro 
^?1?95 200 205 

isSer'?^rPro?&Pro>^Pro59HiSMGlut'Glu.;Ser .Val Glu His -Phe -Val Arg Arg Asn 
^p%2ia .215 220 

-^Leus^Gly--*Asp.>Glu Val 'Phe Glu Arg Leu He Glu Pro Phe Cys-^ Ser Gly 
••i225 :230 235 240 

'nValnTyrt^Aia. Gly Asp Pro Ala Lys Leu Ser Met Lys Ala Ala Phe Gly 

245 250 255 

-Lys Val Tip Lys Leu Glu Gin Lys Gly Gly Ser He He Gly Gly Thr 

260 265 270 

Leu Lys Ala He Gin Glu Arg Gly Ser Asn Pro Lys Pro Pro Arg Asp 
275 280 285 

Gin Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys 
290 295 300 

Gly Leu Val Met Leu Pro Thr Ala He Ser Ala Arg Leu Gly Ser Arg 
305 310 315 320 

Val Lys Leu Ser Trp Thr Leu Ser Ser He Val Lys Ser Leu Asn Gly 

325 330 335 

Glu Tyr Ser Leu Thr Tyr Asp Thr Pro Asp Gly Leu Val Ser Val Arg 

340 . 345 350 
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Thr Lys Ser Val Val Met Thr Val Pro Ser Tyr Val Ala Ser Arg Leu 
355 360 365 

Leu Arg Pro Leu Ser Asp Ser Ala Ala Asp Ser Leu Ser Lys Phe Tyr 
370 375 380 

Tyr Pro Pro Val Ala Ala Val Ser Leu Ser Tyr Pro Lys Glu Ala lie 
385 390 395 400 

Arg Ser Glu Cys Leu lie Asn Gly Glu Leu Gin Gly Phe Gly Gin Leu 

405 410 415 

His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr lie Tyr Ser Ser 

420 425 430 

Ser Leu Phe Pro Gly Arg Ala Pro Pro Gly Arg lie Leu lie Leu Ser 
435 440 445 

Tyr He Gly Gly Ala Lys Asn Pro Gly He Leu Asn Lys Ser Lys Asp 
450 455 460 

Glu Leu Ala Lys Thr Val Asp Lys Asp Leu Arg Arg Met Leu He Asn 
465 470 475 480 

Pro Asp Ala Lys Leu Pro Arg Val Leu Gly Val Arg Val Trp Pro Gin 

485 490 495 

Ala He Pro Gin Phe Ser He Gly His Phe Asp Leu Leu Asp Ala Ala 

500 505 510 

Lys Ala Ala Leu Thr Asp Thr Gly Val Lys Gly Leu Phe Leu Gly Gly 
515 520 525 

Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys He Glu Gly Ala Tyr 
530 535 540 



Glu Ser Ala Ala Glu Val Val Asp Phe Leu Ser Gin Tyr Ser Asp Lys 
545 550 555 560 



<210> 19 
<211> 1784 
<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (47).. (1654) 

<223> oilseed rape protox-1 coding sequence 
<400> 19 

gggccccccc caaaattgag gattctcctt ctcgcgggcg atcgcc atg gat tta 55 

Met Asp Leu 
1 
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tct ctt etc cgt ccg cag cca ttc eta teg oca tte tea aat eca ttt 103 
Ser Leu Leu Arg Pro Gin Pro Phe Leu Ser Pro Phe Ser Asn Pro Phe 
5 10 15 

cct egg teg cgt ecc tac aag ect etc aac etc cgt tgc tec gta tec 151 
Pro Arg Ser Arg Pro Tyr Lys Pro Leu .Asn Leu Arg Cys Ser Val Ser 
20 25 30 35 

ggt gga tec gtc gtc ggc tct tct aca ate gaa ggc gga gga gga ggt 199 
Gly Gly Ser Val Val Gly Ser Ser Thr lie Glu Gly Gly Gly Gly Gly 

40 45 50 

aaa ace gtc acg geg gac tgc gtg ate gtc ggc gga gga ate age ggc 247 
Lys Thr Val Thr Ala Asp Cys Val He Val Gly Gly Gly He Ser Gly 

55 60 65 

ctg tgc att geg caa geg etc gtg acg aag cac cca gac get gca aag 295 
Leu Cys He Ala Gin Ala Leu Val Thr Lys His Pro Asp Ala Ala Lys 
70 75 80 

aat gtg atg gtg acg gag geg aag gac cgt gtg gga ggg aat ate ate 343 
Asn Val Met Val Thr Glu Ala Lys Asp Arg Val Gly Gly Asn He He 
85 90 95 

-acg cga- gag- gag caa ggg ttt eta tgg gaa gaa ggt ecc aat age ttt 391 
Thr* Arg- Glu Glu^ Gin Gly Phe Leu Trp Glu Glu Gly Pro Asn Ser Phe 
100 105 110 115 

cag .ecg vtet^gat cct -atg etc -act atg gtg gta. gat agt ggt ttg aaa 439 
■^Gln-feiPro^^Ser^Aspv-Pro'-^^Met Leu^>Thr'*Met^ Val Val Asp Ser Gly Leu Lys 

-1^20 , 125 130 

i;gat5..gat':cta7.gtc -t.tgTv.gga gat-- cct -act. get ccg agg ttt gtg ttg tgg 487 
Asp^'Asp^Leur-Val iLeu Gly Asp vPro -.Thr Ala Pro Arg Phe Val Leu Trp 

135 -140 145 

^aat.uggg. aag -ctg agg-ccg gtt ccg teg aag eta act gac ttg cct ttc 535 
Asn Gly ^ Lys Leu. Arg Pro Val Pro Ser Lys Leu Thr Asp Leu Pro Phe 
150 155 160 

ttt ,gac ttg atg agt att gga ggg aag att aga get ggg ttt ggt gee 583 
Phe Asp Leu Met Ser He Gly Gly Lys He Arg Ala Gly Phe Gly Ala 
165 170 175 

att ggt att cga cct tea ect ccg ggt cgt gag gaa tea gtg gaa gag 631 
He Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu Ser Val Glu Glu 
180 185 190 195 

ttt gta agg cgt aat ett ggt gat gag gtt ttt gag cge ttg att gaa 679 
Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu Arg Leu He Glu 

200 205 210 

cec ttt tgc tea ggt gtt tat geg gga gat ect geg aaa ctg agt atg 727 
Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ala Lys Leu Ser Met 

215 220 225 

aaa gea get ttt ggg aag gtt tgg aag eta gag gag aat ggt ggg age 775 
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Lys Ala Ala 
230 

ate att ggt 
lie He Gly 
245 

aag aca acc 
Lys Thr Thr 
260 

ggt tct ttc 
Gly Ser Phe 



agg ttg ggt 
Arg Leu Gly 



aag ctg gcc 
Lys Leu Ala 
310 

ata gtc act 
He Val Thr 
325 

gtt get agt 
Val Ala Ser 
340 

etc tea aaa 
Leu Ser Lys 



gcg aaa gaa 
Ala Lys Glu 



ggg ttc ggc 

Gly Phe Gly 
390 

aca ata tae 
Thr He Tyr 
405 

gta ttg eta 
Val Leu Leu 
420 

tea aag teg 
Ser Lys Ser 



aag atg ctg 
Lys Met Leu 



Phe Gly Lys 



ggt get ttt 
Gly Ala Phe 

ega gat ceg 
Arg Asp Pro 
265 

agg aaa gga 
Arg Lys Gly 
280 

gac aag gtg 
Asp Lys Val 
295 

age gga gaa 
Ser Gly Glu 



gta cag age 
Val Gin Ser 



agt etc ttg 
Ser Leu Leu 
345 

etc tac tat 
Leu Tyr Tyr 
360 

gca ate cga 
Ala He Arg 
375 

cag ttg cat 
Gin Leu His 



agt tea teg 
Ser Ser Ser 



ttg aac tac 
Leu Asn Tyr 
425 

gaa ggt gag 
Glu Gly Glu 
440 

ata aag cca 
He Lys Pro 



Val Trp Lys 
235 

aag gca att 
Lys Ala He 
250 

cgt ctg eca 
Arg Leu Pro 



etc aca atg 
Leu Thr Met 



aaa gtt tct 
Lys Val Ser 
300 

tat age tta 
Tyr Ser Leu 
• 315 

aaa agt gta 
Lys Ser Val 
330 

cgc cet etc 
Arg Pro Leu 



ceg eca gtt 
Pro Pro Val 



age gaa tgc 
Ser Glu Cys 
380 

cca ege acg 
Pro Arg Thr 
395 

etc ttt ecc 
Leu Phe Pro 
410 

ate ggt gga 
He Gly Gly 



tta gtg gaa 
Leu Val Glu 



age teg acc 
Ser Ser Thr 



Leu Glu Glu 



caa gcg aaa 
Gin Ala Lys 
255 

aag cca aag 
Lys Pro Lys 
270 

ctg cca gag 
Leu Pro Glu 
285 

tgg aag etc 
Trp Lys Leu 



act tac gaa 
Thr Tyr Glu 



gtg atg act 
Val Met Thr 
335 

tct gat tct 
Ser Asp Ser 
350 

gca gcc gta 
Ala Ala Val 
365 

tta ata gat 
Leu He Asp 



caa aaa gtg 
Gin Lys Val 



aac cga gca 
Asn Arg Ala 
415 

get ace aac 
Ala Thr Asn 
430 

gca gta gat 
Ala Val Asp 
445 

gat cca ctt 
Asp Pro Leu 



Asn Gly Gly 
240 

aat aaa get 
Asn Lys Ala 



ggc caa act 
Gly Gin Thr 



gca ate tec 
Ala He Ser 
290 

tea agt ate 
Ser Ser He 
305 

act ceg gag 
Thr Pro Glu 
320 

gtg cca tct 
Val Pro Ser 



gca get gaa 
Ala Ala Glu 



tec ate tea 
Ser He Ser 
370 

ggt gaa eta 
Gly Glu Leu 
385 

gaa act ctt 
Glu Thr Leu 
400 

ceg cet gga 
Pro Pro Gly 



act ggg ate 
Thr Gly He 



aga gac ttg 
Arg Asp Leu 
450 

gta ctt gga 
Val Leu Gly 



Ser 



cec 823 
Pro 



gtt 871 

Val 

275 

gca 919 
Ala 



act 967 
Thr 



ggt 1015 
Gly 



cat 1063 
His 



gcg 1111 

Ala 

355 

tac 1159 
Tyr 



aaa 1207 
Lys 



gga 1255 
Gly 



aga 1303 
Arg 



tta 1351 

Leu 

435 

agg 1399 
Arg 



gta 1447 
Val 
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455 



460 



465 



aaa tta tgg cct caa gcc att cct cag ttt ctg ata ggt cac att gat 
hys Leu Trp Pro Gin Ala lie Pro Gin Phe Leu lie Gly His lie Asp 
470 475 480 



1495 



ttg gta gac gca gcg aaa gca teg etc teg tea tct ggt cat gag ggc 
Leu Val Asp Ala Ala Lys Ala Ser Leu Ser Ser Ser Gly His Glu Gly 
485 490 495 



1543 



tta ttc ttg ggt gga aat tac gtt gcc ggt gta gca ttg ggt egg tgt 
Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys 
500 505 510 515 



1591 



gtg gaa ggt get tat gaa act gca acc caa gtg aat gat ttc atg tea 
Val Glu Gly Ala Tyr Glu Thr Ala Thr Gin Val Asn Asp Phe Met Ser 

520 525 530 



1639 



agg tat get tac aag taatgtaacg cagcaaegat ttgatactaa gtagtagatt 1694 
Arg Tyr Ala Tyr Lys 

535 

ttgcagtttt ..gactttaaga acactctgtt tgtgaaaaat tcaagtctgt gattgagtaa 1754 



"-att^tatgtat itattaetaaa aaaaaaaaaa 



1784 



. <21"0> *;20 
:.<2l 1=^^53 6 
21*2 >3iPRT 
><c2'S3 ^?|Br a s si cafan^ 



^Me t Asp^^Beu KSer.^Leuw^ >jPro Gin - Pro / Phe Leu - Ser : Pr o-^?Phe Ser 

-1 5 10 15 

t_i:;AsiitsPro :Phe:;Pro%i^ Tyr Lys Pro ^ Leu Asn LeuvArg Cys 

^20 ^ • 25 , 30 

,:Serr Val Ser Gly Gly Ser. Val Val Gly Ser Ser Thr lie Glu Gly Gly 
35 40 45 

Gly Gly Gly Lys Thr Val Thr Ala Asp Cys Val lie Val Gly Gly Gly 
50 55 60 

lie Ser Gly Leu Cys lie Ala Gin Ala Leu Val Thr Lys His Pro Asp 

65 70 75 80 

Ala Ala Lys Asn Val Met Val Thr Glu Ala Lys Asp Arg Val Gly Gly 

85 90 95 

Asn lie lie Thr Arg Glu Glu Gin Gly Phe Leu Trp Glu Glu Gly Pro 

100 105 110 



Asn Ser Phe Gin Pro Ser Asp Pro Met Leu Thr Met Val Val Asp Ser 
115 120 125 
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Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro Thr Ala Pro Arg Phe 
130 135 140 

Val Leu Trp Asn Gly Lys Leu Arg Pro Val Pro Ser Lys Leu Thr Asp 
145 150 155 160 

Leu Pro Phe Phe Asp Leu Met Ser lie Gly Gly Lys lie Arg Ala Gly 

165 170 175 

Phe Gly Ala He Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu Ser 

180 185 190 

Val Glu Glu Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu Arg 
195 200 205 

Leu He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ala Lys 
210 215 220 

Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Lys Leu Glu Glu Asn 
225 230 235 240 

Gly Gly Ser He He Gly Gly Ala Phe Lys Ala He Gin Ala Lys Asn 

245 250 255 

Lys Ala Pro Lys Thr Thr Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 

260 265 270 

Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Glu Ala 
275 280 285 

He Ser Ala Arg Leu Gly Asp Lys Val Lys Val Ser Trp Lys Leu Ser 
290 295 300 

Ser He Thr Lys Leu Ala Ser Gly Glu Tyr Ser Leu Thr Tyr Glu Thr 
305 310 315 320 

Pro Glu Gly He Val Thr Val Gin Ser Lys Ser Val Val Met Thr Val 

325 330 335 

Pro Ser His Val Ala Ser Ser Leu Leu Arg Pro Leu Ser Asp Ser Ala 

340 345 350 

Ala Glu Ala Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val Ser 
355 360 365 

He Ser Tyr Ala Lys Glu Ala He Arg Ser Glu Cys Leu He Asp Gly 
370 375 380 

Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Thr Gin Lys Val Glu 
385 390 395 400 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 

405 410 415 

Pro Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr 

420 425 430 
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Gly He Leu Ser Lys 
435 

Asp Leu Arg Lys Met 
450 

Leu Gly Val Lys Leu 
465 

His He Asp Leu Val 

485 

His Glu Gly Leu Phe 

500 

Gly Arg Cys Val Glu 
515 

Phe Met Ser Arg Tyr 
530 



Ser Glu Gly Glu Leu Val 
440 

Leu He Lys Pro Ser Ser 
455 

Trp Pro Gin Ala He Pro 
470 475 

Asp Ala Ala Lys Ala Ser 

490 

Leu Gly Gly Asn Tyr Val 

505 

Gly Ala Tyr Glu Thr Ala 
520 

Ala Tyr Lys 
535 



Glu Ala Val Asp Arg 
445 

Thr Asp Pro Leu Val 
460 

Gin Phe Leu He Gly 

480 

Leu Ser Ser Ser Gly 

495 

Ala Gly Val Ala Leu 
510 

Thr Gin Val Asn Asp 
525 



-<210>-21 

<211>^1224 
' <21'-2> -DNA 
^<2 16>*v0ryza.^sat-iva 

«^<220> , . . . 

:^^2d>^GDS ' . ' 

::^2:2<2>^(1 093 6) : . . 

2 26 > v^r i c e. .-pr o t ox^ Ivpart ialiiitc b ding , s equenc e 

;^4.00>^:pl' / ■ ■■ 

"-egg^gctr^ttg aag.^gct: gca ttt -ggg. aag gtg tgg agg ctg. gag gat act 48 
:.:iArg 'Ala;rLeu Lys^;Ala.;Ala Phe ; Gly tLys Val Trp Arg Leu j Glu Asp Thr 
1 5 10 15 

.gga ggt age att att ggt gga acc ate aag aea ate eag gag agg ggg 96 
Gly Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 

20 25 30 

aaa aac ccc aaa ccg ccg agg gat ccc cgc ctt cca acg cca aag ggg 144 
Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Thr Pro Lys Gly 
35 40 45 

cag aca gtt gca tct ttc agg aag ggt ctg act atg etc ccg gat get 192 
Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Asp Ala 
50 55 60 

att aca tet agg ttg ggt age aaa gtc aaa ctt tea tgg aag ttg aca 240 
He Thr Ser Arg Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
65 70 75 80 

age att aea aag tea gac aac aaa gga tat gca tta gtg tat gaa aca 288 
Ser He Thr Lys Ser Asp Asn Lys Gly Tyi^ Ala Leu Val Tyr Glu Thr 

85 90 95 
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cca gaa ggg gtg gtc teg gtg caa get aaa act gtt gtc atg acc ate 336 
Pro Glu Gly Val Val Ser Val Gin Ala Lys Thr Val Val Met Thr lie 

100 105 110 

cca tea tat gtt get agt gat ate ttg egg eea ctt tea agt gat gca 384 
Pro Ser Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Ser Asp Ala 
115 120 125 

gea gat get ctg tea ata ttc tat tat cca cca gtt get get gta act 432 
Ala Asp Ala Leu Ser He Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 
130 135 140 

gtt tea tat cca aaa gaa gea att aga aaa gaa tgc tta att gae gga 480 
Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
145 150 155 160 

gag etc cag ggt ttc ggc cag ctg cat ccg cgt agt cag gga gtt gag 528 
Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 

165 170 175 

act tta gga aca ata tat age tea tea etc ttt cca aat cgt get eea 576 
Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 

180 185 • 190 

get gga agg gtg tta ctt ctg aae tae ata gga ggt tet aca aat aca 624 
Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn Thr 
195 200 205 

ggg att gtt tec aag act gaa agt gag ctg gta gaa gca gtt gae cgt 672 
Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
210 215 220 

gae etc agg aag atg ctg ata aat ect aga gea gtg gae cct ttg gtc 720 
Asp Leu Arg Lys Met Leu He Asn Pro Arg Ala Val Asp Pro Leu Val 
225 230 235 240 

ctt ggc gtc egg gta tgg eea eaa gee ata cca cag ttc etc att ggc 768 
Leu Gly Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu He Gly 

245 250 255 

eat ctt gat eat ctt gag get gea aaa tet gee ctg ggc aaa ggt ggg 816 
His Leu Asp His Leu Glu Ala Ala Lys Ser Ala Leu Gly Lys Gly Gly 

260 265 270 

tat gat gga ttg ttc etc gga ggg aae tat gtt gca gga gtt gee ctg 864 
Tyr Asp Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
275 280 285 

ggc cga tgc gtt gaa ggt gea tat gag agt gee tea caa ata tet gae 912 
Gly Arg Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp 
290 295 300 



tac ttg ace aag tac gee tac aag tgateaaagt tggcctgctc cttttggcae 966 
Tyr Leu Thr Lys Tyr Ala Tyr Lys 
305 310 
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atagatgtga ggcttctagc agcaaaaatt tcatgggcat ctttttatcc tgattctaat 1026 

tagttagaat ttagaattgt agaggaatgt tccatttgca gttcataata gttgttcaga 1086 

tttcagccat tcaatttgtg cagccattta ctatatgtag tatgatcttg taagtactac 1146 

taagaacaaa tcaattatat tttcctgcaa gtgacatctt aatcgtcagc aaatccagtt 1206 

actagtaaaa aaaaaaaa 1224 



<210> 22 
<211> 312 
<212> PRT 

<213> Oryza sativa 
<400> 22 

Arg Ala Leu Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Asp Thr 
15 10 15 

Gly ' Gly Ser lie lie -Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 

20 ..--25 30 

Lys' -Asn Pro Lys ' Pro "Pro Arg Asp Pro Arg Leu Pro Thr Pro Lys Gly 

.'.35 .40 45 

Gln-'iThrv^Val; Alai^Ser ^^Phe . ^Arg - Lys Gly Leu'r Thr Met - Leu Pr o- ^ Asp Ala 

"^'^50 ^"'^-55 60 



^^i|,I^le?^^hr^Ser^Argr^^Leul^G ^Ser"' Trp -Lys- Leu Thr 

#65 ',^70 -75 80 

-^S ei^Jg:^le:^*Thr ^5<Ly ss^fS e r7?^;Asp]r^Asn^Ly s > -Gly .Tyr ^Al a ^ Leu - Va 1 Ty r Glu ' Thr 

:m5 90 95 

.f-*Pro^;Glu. Gly;:'Val J-VaV Ser Val t Gin Ala Lys Thr Val Val Met Thr He 

100 - 105 110 

-'Pro>Ser. Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Ser Asp Ala 
115 120 125 

Ala Asp Ala Leu Ser He Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 
130 135 140 

Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
145 150 155 160 

Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 

165 170 175 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 

180 185 190 

Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn Thr 
195 200 205 

Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
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210 215 220 

Asp Leu Arg Lys Met Leu lie Asn Pro Arg Ala Val Asp Pro Leu Val 
225 230 235 240 

Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu lie Gly 

245 250 255 

His Leu Asp His Leu Glu Ala Ala Lys Ser Ala Leu Gly Lys Gly Gly 

260 265 270 

Tyr Asp Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
275 280 285 

Gly Arg Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin lie Ser Asp 
290 295 300 

Tyr Leu Thr Lys Tyr Ala Tyr Lys 
305 310 



<210> 23 
<211> 1590 
<212> DNA 

<213> Sorghum bicolor 

<220> 

<221> CDS 

<222> (1) . . (1320) 

<223> sorghum protox-1 partial coding sequence 
<400> 23 

tec acc gtc gag cgc ccc gag gaa ggg tac etc tgg gag gag ggt ccc 48 
Ser Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro 
15 10 15 

aac age ttc cag cca tec gae ecc gtt etc tec atg gee gtg gae age 96 
Asn Ser Phe Gin Pro Ser Asp Pro Val Leu Ser Met Ala Val Asp Ser 

20 25 30 

ggg ctg aag gat gae ctg gtt ttt ggg gae ccc aac gcg cca egg ttc 144 
Gly Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe 
35 40 45 

gtg ctg tgg gag ggg aag ctg agg ccc gtg eea tee aag ccc gee gae 192 
Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp 
50 55 60 

etc ecg ttc ttc gat etc atg age ate cet ggc aag etc agg gee ggt 240 
Leu Pro Phe Phe Asp Leu Met Ser He Pro Gly Lys Leu Arg Ala Gly 
65 70 75 80 

etc ggc gcg ctt ggc ate cgc ccg cet get eea ggc cgc gag gag tea 288 
Leu Gly Ala Leu Gly He Arg Pro Pro Ala Pro Gly Arg Glu Glu Ser 

85 90 95 
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gtg gag gag ttt gtg cgc cgc aac etc ggt get gag gtc ttt gag cge 336 

Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg 

100 105 110 

eta att gag cct ttc tgc tea ggt gte tat get ggc gat ect tec aag 3 84 

Leu lie Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys 
115 120 125 

etc agt atg aag get gca ttt ggg aag gtg tgg egg tta gaa gaa get 432 

Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Ala 
130 135 140 

gga ggt agt att att ggt gga ace ate aag acg att cag gag agg gge 480 

Gly Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 

145 150 155 160 

aag aat cea aaa eca ceg agg gat cee ege ctt ccg aag cca aaa ggg 528 

Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 

165 170 175 

cag aca gtt gca tet ttc agg aag ggt ctt gee atg ctt cca aat gee 576 

Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala 

180 185 190 

ate;i;aca%tee age ittg ggt agt aaa , gte aaa eta tea tgg aaa etc acg 624 

"ile'-Thr Ser Ser rLeu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
1'95 "200 . 205 

^^age^4^aj:gi,.aea; aaav^:t^ aag-t^sggg- tat rgtt ttg gag tat gaa aca 672 

^Se-i?|^Me t^Thr-^^Lyslii'Ber'i^^Asp vGly Lys^^^Gly -Tyr "^Val*^Leu^;Glu' Tyr Glu -Thr 
^10 .215 2-20 

' -cca|igaa^gggKgtt45g^:t^;fttg -gtg cag;:^gct^^aaa-agt vgttvatc atg-^ace att ■ 720 

V ^Pro-Glu' Gly .Vali,iyal .Leu Val Gln,?Aaa Lys Ser Val He Met Thr He 

'"2'25 230 235 240 

cca tea tat gtt get agc.gae att ttg cgt cca ctt tea ggt gat- get 768 

.Pro Ser Tyr Val^Ala Ser Asp He . Leu Arg Pro Leu Ser Gly Asp Ala 

'245 250 ^ 255 

gca gat gtt eta tea aga ttc tat tat cca cca gtt get get gta acg 816 

Ala Asp Val Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 

260 265 270 

gtt teg tat cca aag gaa gca att aga aaa gaa tgc tta att gat ggg 864 

Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
275 280 285 

gaa etc cag ggt ttt ggc cag ttg cat cca cgt agt caa gga gtt gag 912 

Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
290 295 300 

aca tta gga aca ata tae age tea tea etc ttt cca aat cgt get cct 960 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 

305 310 315 320 

get ggt agg gtg tta ctt eta aac tae ata gga ggt get aca aac aca 1008 
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Ala Gly Arg Val Leu Leu Leu Asn Tyr lie Gly Gly Ala Thr Asn Thr 

325 330 335 

gga att gtt tec aag act gaa agt gag ctg gta gaa gca gtt gac cgt 1056 
Gly lie Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 

340 345 350 

gac etc cga aaa atg ctt ata aat cct aca gca gtg gac cct tta gtc 1104 
Asp Leu Arg Lys Met Leu lie Asn Pro Thr Ala Val Asp Pro Leu Val 
355 360 365 

ctt ggt gtc cga gtt tgg cca caa gcc ata cct cag ttc ctg gta gga 1152 
Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly 
370 375 380 

cat ctt gat ctt ctg gag gcc gca aaa tct gcc ctg gac caa ggt ggc 1200 
His Leu Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Gin Gly Gly 
385 390 395 400 

tat aat ggg ctg ttc eta gga ggg aac tat gtt gca gga gtt gee ctg 1248 
Tyr Asn Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 

405 410 415 

ggc aga tgc att gag ggc gca tat gag agt gcc gcg caa ata tat gac 1296 
Gly Arg Cys lie Glu Gly Ala Tyr Glu Ser Ala Ala Gin He Tyr Asp 

420 425 430 

ttc ttg ace aag tac gcc tac aag tgatggaaga agtggagegc tgcttgttaa 1350 
Phe Leu Thr Lys Tyr Ala Tyr Lys 
435 440 

ttgttatgtt gcatagatga ggtgagaeea ggagtagtaa aaggcgteac gagtattttt 1410 

cat tct tat t ttgtaaattg cacttctgtt tttttttcet gtcagtaatt agttagattt 1470 

tagttatgta ggagattgtt gtgttcaetg cectacaaaa gaatttttat tttgcattcg 1530 

tttatgagag ctgtgcagac ttatgtaacg ttttactgta agtatcaaca aaatcaaata 1590 



<210> 24 
<211> 440 
<212> PRT 

<213> Sorghum bicolor 
<400> 24 

Ser Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro 
15 10 15 

Asn Ser Phe Gin Pro Ser Asp Pro Val Leu Ser Met Ala Val Asp Ser 

20 25 30 

Gly Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe 
35 40 45 

Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp 
50 55 60 
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Leu Pro Phe Phe Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly 
65 70 75 80 

Leu Gly Ala Leu Gly lie Arg Pro Pro Ala Pro Gly Arg Glu Glu Ser 

85 90 95 

Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg 

100 105 110 

Leu lie Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys 
115 120 125 

Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Ala 
130 135 140 

Gly Gly Ser lie lie Gly Gly Thr lie Lys Thr. lie Gin Glu Arg Gly 
145 150 155 160 

Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 

165 170 175 

-Glm,Thr^:*Val A-la Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala 

180 185 190 

Ile'^-'Thr-'^SerwSer 'Leu Gly Ser Lys Val -Lys Leu Ser Trp Lys Leu Thr 
.1^95 200 205 

Ser^; Me t ^Thr^:Lys>^ 'Se r>f Asp Gly - Lys Gly ^ Tyr Va 1 Leu Glu Tyr Glu Thr 

...;..-2<L0 : ^^215 ^220 ' 

-;^Pro^Glm^GlyM7al-Val«&Leu -Val -Gin Ala Lys- Val" lie Met Thr He 
e225 :V230 -235 240 

^'•Pro Ser.iTyr=*-Val Ala^^ Ser ^Asp He Leu Arg Pro Leu Ser Gly Asp Ala 

245 ,250 255 

'. Ala-'Asp^'Val Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 

250 265 270 

Val i Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
275 280 285 

Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
290 295 300 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
305 310 315 320 

Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr 

325 330 335 

Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 

340 345 350 



Asp Leu Arg Lys Met Leu He Asn Pro Thr Ala Val Asp Pro Leu Val 
355 360 365 
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Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly 
370 375 380 

His Leu Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Gin Gly Gly 
385 390 395 400 

Tyr Asn Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 

405 410 415 

Gly Arg Cys He Glu Gly Ala Tyr Glu Ser Ala Ala Gin He Tyr Asp 

420 425 430 

Phe Leu Thr Lys Tyr Ala Tyr Lys 
435 440 



<210> 25 
<211> 93 
<212> DNA 
<213> 2ea mays 

<220> 

<221> misc_f eature 
<222> (1)..(93) 

<223> maize protox-1 intron sequence 
<400> 25 

gtacgctcct cgctggcgcc gcagcgtctt cttctcagac tcatgcgcag ccatggaatt 60 
gagatgctga atggatttta tacgcgcgcg cag 93 

<210> 26 
<211> 2606 
<212> DNA 

<213> Beta vulgaris 
<220> 

<221> inisc_f eature 
<222> (2601) (2606) 
<223> Sail site 

<220> 

<221> misc_.f eature 
<222> (1) . . (538) 

<223> partial c-DNA of sugar beet protox-1 
<220> 

<221> inisc__f eature 
<222> (539) (2606) 

<223> sugar beet protox-1 promoter region (partial 

sequence of the -Skb Pstl-Sall fragment subcloned 
from pWDC-20) 

<400> 26 

ctgcaggggg agggaaagag agaccgcgac ggtgagggag gggagaccgc gacggtgagg 60 
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gaggggagaa cgcgacggtg agggagggga gaacgcgatg gtgagggagg ggagaacgcg 120 
acgcgcaggg gagggggata actcgacggt gcagggaggt gagggggacg acgtgacggc 180 
gcaggggagg ggggaaccgt cgcgggaagg ggaagaccgg ggggccgaca aggtggtgtt 240 
actggggtag ggagaggcgg cgtggagaat agtaacagag ggaggagtgg tggtgctagg 300 
gtggaagaag ggtaagaaag aggaagaaag agaattaaca ttatcttaac caaacaccac 360 
tctaaatcta agggttttct tttcctttcc tctcctctcc ctttcttgat tccattccct 420 
ttaccccgtt gcaaccaaac gcccccttat tatggaccgg aggaagtatg tagagatggt 480 
cacaaaacta cttaagctgg taacttataa atatactggg tattaaatga attaagtggc 540 
cacaaaatga ctataaatta cttcgtaatc tttaggaact atgttggtca cgaaataaca 600 
taaaactggt tatttaatgg ctttatgtag gtactgcatt cataaatata tttctaacat 660 
aatcgtggta tgtaggtgtt ttataacaca aggattaggt ttacaccaat gtcattttca 720 
ttagaatgta gttagaatca ctttggaact ttgaagagtg atgacacatt tttattatgc 780 
ttttatgaaa tgtctttgtg gtttttatga tagtattgag tttaaggcaa gttggaagta 840 
tatgatggag aagtacagta tataggtgac aattggtttg cttgtttcta tgagttgaaa 900 
gataagtagt acacgacact gagcaatgac ctcttcttag ttgtaatttt gtcttctcga 960 
cgtagtgaaa gtacaaacaa gattatggct ttcaagcttc caagataacg agattgtatg 1020 
aattttgtgg tgtatttcac atcattgttt tacgttggag acaaactaaa accaatgatg 1080 
agtttgtgga ttcgagattt gcccctaagt cttatttacc catggcaagc atgctgaaac 1140 
atgttagtca aacttacaca gctacaatgt ttagggattt tgagcaaaaa atttgggtat 1200 
tctttgggta ccattatgtg agttgttgac tatggattaa acaaaatcac tatataaagt 1260 
ctggaatgag aagcafeccgc aattgacaca ccatgttact ttgattgttt caacaagttt 1320 
attagatgta -tttgtaggaa v-ttttgaagag gcggagatgt tgtgttataa ttgctttggg 1380 
'ggtgcttGac*a,fegcaotGttg ttagtgagac atcttcagct tatattttaa ggcggttagt 1440 
gagtatgatt tttttttttc'-aaacttttcg atttcdatgt aattaaaaaa .ggtgtttgat 1500 
, aaatacatgfe^>,«taagafeagcc,>,aagaaaaggc s.aacttteaaa caaataaaaa aaattaagtc 1560 
gcttaateat'"t'fe%tcGaa^gt*raGfet^^^ tttaaGacca cttattactg aatctatagc 1620 

cgt t aagaa t^gGa btt :tGac.:>gG.tGa,fcaGa t -gcaaa t caag aacG tec tGa ^ 1 1 gaaggaga 1680 
taatttagtc ctGa^taaaGC'-^Gcgttraaaga catttttagG atccagagaa atttGgattG 1740 
agt'taaaat t^'gcat ata feaarSGcagagaaac vaaa t tcaga t : g 1 1 a gt ca'gt ccagc ta.Ga t 1800 
•aggtcaatgc^^^cfegagagttt^i'aaaat^^ gtattgaggt 1860 

,gagittaGaaa!^,^gg.taag;t,tac/:^iCg^^ caaacaagta tggttagaag 1920 

ataGatgtaa'^'fe^:g.ttta^^tt*^agag - GfcttttaaGt ragaaaGagtt 1980 

■-gtt tGatfet t^gatA^feaaggt;t.fe^ gGaagaaggg ga tatagagg 2040 

ataaggagggKiGg^gagagGg^.^agagaga^ gcgttatcaa, aGtgcattcc 2100 

.acagaGaeagi^-gGa.tgGGat.;4:gGaeagGag:.cgggGat aggggcaatt-^ gtatcatgtt 2160 
gtcaattGca'Lfcgtagtttaa^ ttggaagacg naggttattat tcacataaga agaggaggat 2220 
gagGatgagt tgcagGacaa gctcaggGtc aaagtcagcg gttaaagaag GaggatGagg 2280 
atcaggatGa ggageaggag gattgctaga ctgcgtaatc gttggaggtg gaattagcgg 2340 
gctttgcatc gcgcaggetG- tttgtacaaa. acagtcctct ttatcGccaa attttatagt 2400 
gacagaggGG .aaagacagag ttggcggcaa catcgtGact gtggaggGGg atggGtatat 2460 
ctgggaggag ggaGGcaata gGttGGagcG ttCGgaGgcg gtgGtcacGa tggcggtaat 2520 
tctgtctGtt Gattattcat aatcataatt caattcaatt caattGctaa Ggtggaatgt 2580 
ggaatgtggc atgtgcgtag gtGgaG 2606 



<210> 27 

<211> 31 

<212> DNA 

<213> ArtifiGial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Pclp_Pla - 

plastid clpP gene promoter top strand PGR primer 

<220> 

<221> misc_feature 
<222> (4) . . (9) 
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<223> EcoRI restriction site 
<400> 27 

gcggaattca tacttattta tcattagaaa g 

<210> 28 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pclp_Plb 
plastid clpP gene promoter bottom strand PGR 
primer 

<220> 

<221> misc_feature 

<222> (4) . . (9) 

<223> Xbal restriction site 

<400> 28 

gcgtctagaa agaactaaat actatatttc ac 

<210> 29 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pclp__P2b 
plastid clpP gene promoter bottom strand PGR 
primer 

<220> 

<221> mis cofeature 

<222> (4) . . (9) 

<223> Wcol restriction site 

<400> 29 

gcgccatggt aaatgaaaga aagaactaaa 

<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Trpsl6_Pl 
plastid rpslG gene 3' -untranslated region 
Xbal/Hindlll top strand PGR primer 

<220> 

<221> misc_feature 
<222> (4) . . (9) 
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<223> Xbal restriction site 



<400> 30 

gcgtctagat caaccgaaat tcaattaagg 



30 



<210> 31 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Trpsl6_plb - 
plastid rpsl6 gene 3' untranslated region 
Xbal/Hindlll bottom strand PGR 
primer 

<220> 

<221> misc„feature 
<222> (4).. (9) 

<223> •Hiridlll restriction site , ... 

<400> 31 . . , , , . . , 

cgcaagettc aatggaagca -atgataa - .. . 27 



<210> 33 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: minpsb_L - 
plastid psbA gene 5' vintranslated region 38 nt 
(blunt /Ncol) including ATG start codon (bottom 
strand primer) 

<400> 33 

catggtaaaa tcttggttta tttaatcatc agggactccc 40 



y<210>'32 
ti<211>.'.36 
•?<212>..'»?DNA 



^^2 16 >-*3lrM^f-d^eiai'?'*Sequen^ e 




^^400^^32 . , . - . , 

-.-igggagtccct gatgattaaa taaaccaaga ttttac 



36 



<210> 34 
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<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: APRTXPla - top 
strand PCR primer for amplifying the 5' portion of 
the mutant Arabidopsis protox gene 

<220> 

<221> misc_feature 
<222> (5) . . (10) 

<223> Ncol restriction site/ATG start codon 
<400> 34 

gggaccatgg attgtgtgat tgtcggcgga gg 32 



<210> 35 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: APRTXPlb - 
bottom strand PCR primer for amplifying the 5' 
portion of the miutant Arabidopsis protox gene 

<400> 35 

ctccgctctc cagcttagtg atac 24 



<210> 36 
<211> 633 
<212> DNA 
<213> sugar cane 

<22 0> 

<221> CDS 

•<222> (3) . , (305) 

<223> sugar cane protox- 1 partial coding sequence 
<400> 36 

tt tec aag act gaa agt gag ctg gta gaa gca gtt gac cgt gac etc 47 
Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu 
15 10 15 

egg aaa atg ctt ata aat cct aca gca gtg gac cct tta gtc ctt ggt 95 
Arg Lys Met Leu lie Asn Pro Thr Ala Val Asp Pro Leu Val Leu Gly 

20 25 30 

gtc cga gtt tgg cca caa gcc ata cct cag ttc ctg gta gga cat ctt 143 
Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly His Leu 

35 40 45 

gat ctt ctg gag gcc gca aaa tct gcc ctg gac cga ggt ggc tac gat 191 
Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Arg Gly Gly Tyr Asp 
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50 55 60 

ggg ctg ttc eta gga ggg aac tat gtt gca gga gtt gcc eta ggc aga 239 
Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg 
65 70 75 

tgc gtt gag ggc gcg tat gag agt gcc teg caa ata tat gac ttc ttg 287 
Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Tyr Asp Phe Leu 
80 85 90 95 

acc aag tat gcc tac aag tgatgaaaga agtggagtgc tgcttgttaa 335 
Thr Lys Tyr Ala Tyr Lys 

100 

ttgttatgtt gcatagatga ggtgagacca ggagtagtaa aagcgttacg agtatttttc 395 
attcttattt tgtaaattgc acttctggtt ttttcctgtc agtaattagt tagattttag 455 
ttctgtagga gattgttctg ttcactgccc tacaaaagaa tttttatttt geattcgttt 515 

'atgagagctg^ tgcagactta tgtagcgttt ttctgtaagt accaacaaaa tcaaatacta 575 ^ - 

*;ttc:tgtaaga gctaacagaa tgtgcaactg agattgcctt ggatgaaaaa -aaaaaaaa 633 

^2'10>^37r., - . _ .. , 

^2^:0)1 . , ........ 

?^2a?2>^'^PRT . . , . ^ - 

*';^<-21^3>;?^sugar^Gane . , . . . ■ , - v^.-^ . 

-^Se2^^!Lys^^ttl^ Ala -Val. Asp ■ Arg Asp> Leu .Arg \ ^ ; : : 

■^fLys^et LeuiUIle^sri??Prov^ Asp Pro Leu Val Leu-^GlyVal * ^ ... > v . -^.i .- . 

-20 25 30 

-^ArgWar Trp Pro Gin Ala He Pro. Gin Phe Leu Val Gly His Leu Asp - . 

. "35 .40 45 

v^Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Arg Gly Gly Tyr Asp Gly 
50 55 60 

Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys 
65 70 75 80 

Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Tyr Asp Phe Leu Thr 

85 90 95 

Lys Tyr Ala Tyr Lys 

100 



<210> 38 
<211> 4 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: siib-sequence 1 

<400> 38 
Ala Pro Xaa Phe 
1 



<210> 39 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 8 
<400> 39 

Tyr lie Gly Gly Xaa 
1 5 



<210> 40 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 
12 

<400> 40 
lie Gly Gly Xaa 
1 



<210> 41 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 
13 

<400> 41 

Ser Trp Xaa Leu Xaa 
1 5 



<210> 42 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: svib-sequence 
15 

<400> 42 

Gly Xaa Xaa Gly Leu 
1 5 



<210> 43 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 
17 

<400> 43 
Tyr Val Xaa Gly 
1 



<210> 44 
<2ia>- .1503 
<212>. DNA 
<213> Zea mays 

<220> 
<2'21>vGDS 
.^:2'22>^*'';{1) .i."(=.1^503) 

^^223>;';cpding'¥sequence" -for avmature^maize protox-1 protein 
<400>?544 

gctwgctT^g.tg^,gcg.;,ggc ggc -^gcg- gcc gag-^ gca tec acc ggc gcg - 48 * 

Ala- Ala^^Val ^Ala Gly Gly-rAla- Ala Glu Ala Pro Ala Ser Thr . Gly Ala 
1 5 10 15 

egg ctg- tec gcg gac tgc gtc gtg gtg ggc gga ggc ate agt ggc etc 96 
Arg Leu Ser Ala Asp Cys . Val Val Val Gly Gly Gly He Ser Gly Leu 

20 25 30 

tgc acc gcg cag gcg ctg gee acg egg cac ggc gtc ggg gac gtg ctt 144 
Cys Thr Ala Gin Ala Leu Ala Thr Arg- His Gly Val Gly Asp Val Leu 
35 40 45 

gtc acg gag gee cgc gcc cgc ccc ggc ggc aac att acc acc gtc gag 192 
Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu 
50 55 60 

cgc ccc gag gaa ggg tac etc tgg gag gag ggt ccc aac age ttc cag 240 
Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin 
65 70 75 80 

ccc tec gac cee gtt etc acc atg gee gtg gac age gga etg aag gat 288 
Pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp 

85 90 95 
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gac ttg gtt 
Asp Leu Val 



ggg aag ctg 
Gly Lys Leu 
115 

gat etc atg 
Asp Leu Met 
13 0 

ggc ate cgc 
Gly lie Arg 
145 

gtg cgc cgc 
Val Arg Arg 



ttc tgc tea 
Phe Cys Ser 



get gca ttt 
Ala Ala Phe 
195 

att ggt gga 
lie Gly Gly 
210 

cca ccg agg 
Pro Pro Arg 
225 

tct ttc agg 
Ser Pile Arg 



ttg ggt agt 
Leu Gly Ser 



tea gat gac 
Ser Asp Asp 
275 

gtt teg gtg 
Val Ser Val 
290 

get age aac 
Ala Ser Asn 
305 

tea aga ttc 



ttt ggg gac 
Phe Gly Asp 
100 

agg cec gtg 
Arg Pro Val 



age ate cca 
Ser lie Pro 



ccg ect cet 
Pro Pro Pro 
150 

aac etc ggt 
Asn Leu Gly 
165 

ggt gtc tat 
Gly Val Tyr 
180 

ggg aag gtt 
Gly Lys Val 



ace ate aag 
Thr He Lys 



gat gee cgc 
Asp Ala Arg 
230 

aag ggt ett 
Lys Gly Leu 
245 

aaa gtc aaa 
Lys Val Lys 
260 

aag gga tat 
Lys Gly Tyr 



cag get aaa 
Gin Ala Lys 



att ttg cgt 
He Leu Arg 
310 

tat tat cca 



cca aac geg 
Pro Asn Ala 
105 

eca tec aag 
Pro Ser Lys 
120 

ggg aag etc 
Gly Lys Leu 
135 

cca ggc cgc 
Pro Gly Arg 



get gag gte 
Ala Glu Val 



get ggt gat 
Ala Gly Asp 
185 

tgg egg ttg 
Trp Arg Leu 
200 

aca att cag 
Thr He Gin 
215 

ett ccg aag 
Leu Pro Lys 



gee atg ett 
Ala Met Leu 



eta tea tgg 
Leu Ser Trp 
2 65 

gtt ttg gag 
Val Leu Glu 
280 

agt gtt ate 
Ser Val He 
295 

cca ett tea 
Pro Leu Ser 



ccg gtt get 



ceg egt ttc 
Pro Arg Phe 



cec gee gac 
Pro Ala Asp 



agg gee ggt 
Arg Ala Gly 
140 

gaa gag tea 
Glu Glu Ser 
155 

ttt gag cgc 
Phe Glu Arg 
170 

cet tct aag 
Pro Ser Lys 



gaa gaa act 
Glu Glu Thr 



gag agg age 
Glu Arg Ser 
220 

cca aaa ggg 
Pro Lys Gly 
235 

cca aat gee 
Pro Asn Ala 
250 

aaa etc aeg 
Lys Leu Thr 



tat gaa acg 
Tyr Glu Thr 



atg act att 
Met Thr He 
300 

age gat get 
Ser Asp Ala 
315 

get gta act 



gtg ctg tgg 
Val Leu Trp 
110 

etc ceg ttc 
Leu Pro Phe 
125 

eta ggc geg 
Leu Gly Ala 



gtg gag gag 
Val Glu Glu 



etc att gag 
Leu He Glu 
175 

etc age atg 
Leu Ser Met 
190 

gga ggt agt 
Gly Gly Ser 
205 

aag aat cca 
Lys Asn Pro 



cag aca gtt 
Gin Thr Val 



att aca tee 
He Thr Ser 
255 

age att aca 
Ser He Thr 
270 

cca gaa ggg 
Pro Glu Gly 
285 

cca tea tat 
Pro Ser Tyr 



gea gat get 
Ala Asp Ala 



gtt teg tat 



gag 336 
Glu 



ttc 384 
Phe 



ett 432 
Leu 



ttc 480 

Phe 

160 

cet 528 
Pro 



aag 576 
Lys 

att 624 
He 



aaa 672 
Lys 



gea 720 

Ala 

240 

age 768 
Ser 



aaa 816 
Lys 



gtt 864 
Val 



gtt 912 
Val 



eta 960 

Leu 

320 

cca 1008 
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Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro 

325 330 335 

aag gaa gca att aga aaa gaa tgc tta att gat ggg gaa etc cag ggc 1056 
Lys Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Gin Gly 

340 345 350 

ttt ggc cag ttg cat oca cgt agt caa gga gtt gag aca tta gga aca 1104 
Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 
355 360 365 

ata tac agt tec tea etc ttt cca aat cgt get ect gac ggt agg gtg 1152 
lie Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val 
370 375 380 

tta ctt eta aac tac ata gga ggt get aca aac aca gga att gtt tec 1200 
Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser 
385 390 395 400 

aag- act gaa agt gag ctg gtc gaa gca gtt gac cgt gae etc cga aaa 1248 
>Lys^Thr Glu -Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys 

405 - 410 415 

'^atg>ctt ata^ aat tet^-aca gea gtg gac ect tta gtc ctt ggt gtt cga 1296 
Met-Leu'- Ilev Asn 'Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg 

420 425 430 

^ gttT-tgg^-cca caa gee ata cet cag ttc ctg gta gga cat ctt gat ctt 1344 
':^:yarvTrp -Pro;*.Gln-Ala :-Ile Pro -Gin Phe Leu Val -Gly His Leu Asp Leu 
'4-35 440 ' 445 

^etg^gaa^^geci gca aaa get ■ gee ctg gae ega.ggt ggc tac gat ggg. ctg 1392 
-cLeuV Glu: Ala^Ala- Lys Ala . Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu 
^T^450 -455 - , . . . . , 

ttc^eta . gga:.ggg ^aac tat gtt gea gga gtt gee ctg ggc aga tgc gtt 1440 
^fPhest^Leu Gly -Gly. Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val 
r^M5 470 • 475 ' 480 

tgagj^gge geg tat gaa agt gee teg caa ata tct gac ttc ttg acc aag 1488 
Glu^ Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys 

485 490 495 

tat gee tac aag tga 1503 
Tyr Ala Tyr Lys 

500 



<210> 45 
<211> 500 
<212> PRT . 
<213> Zea mays 

<400> 45 

Ala Ala Val Ala Gly Gly Ala Ala Glu Ala Pro Ala Ser Thr Gly Ala 

1 5 10 . 15 

Arg Leu Ser Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu 
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20 25 30 

Cys Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu 

35 40 45 

Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn lie Thr Thr Val Glu 

50 55 60 

Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin 
65 70 75 80 

pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp 

85 90 95 

Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu 

100 105 110 

Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe 

115 120 125 

Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu 

130 135 140 

Gly He Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe 
145 150 155 160 

Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro 

165 170 175 

Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys 

180 185. 190 

Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser He 

195 200 205 

He Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys 

210 215 220 

Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala 
225 230 235 240 

Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser 

245 250 255 

Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys 

260 265 270 

Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val 

275 280 285 

Val Ser Val Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val 

290 295 300 

Ala Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu 
305 310 315 320 

Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro 

325 330 335 

Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly 

340 345 350 

Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 

355 360 365 

lie Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val 

370 375 380 

Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser 
385 390 395 400 

Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys 

405 410 415 

Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg 

420 425 430 

Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu 

435 440 445 

Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu 

450 455 460 

Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val 
465 470 475 480 
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Glu Gly Ala Tyr Glu Ser Ala Ser Gin lie Ser Asp Phe Leu Thr Lys 

485 490 495 

Tyr Ala Tyr Lys 

500 



<210> 46 
<211> 1608 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1)..(1608) 

<223> predicted coding sequence for a maize protox-1 polypeptide precursor 
<400> 46 

atg gtc gcc gcc aca gcc acc gcc atg gcc acc get gca teg ccg eta 48 
Met Val Ala Ala Thr Ala Thr Ala Met Ala Thr Ala Ala Ser Pro Leu 
1 5 10 15 

cto aaci^ggg ace cga /ata ect geg egg etc cgc cat cga gga etc age 96 
Leu^.'Asn Gly Thr Arg .lie Pro Ala Arg Leu Arg His Arg Gly Leu Ser 

20 25 30 

gtg cge tgc get get gtg gcg ggc ggc gcg gcc gag gca ccg gea tee 144 
?.i.ValJ^>ArglCys*^Ala Ala-Val Ala .Gly . Gly Ala Ala Glu Ala Pro Ala Ser 
-35 40 45 

>-'accc*.ggci^gcg.vicgg' ctg/ tee. gcgr^ tge gtc gtg gtg, ggc gga ggc ate 192 
:;t?rhr^.Glyi*^Ala^Arg Val Val, Gly Gly Gly He 

50- 55 -60 

Aagt.teggc'',:etc tgc-.acc :*gcg eag gcg-ctg.gee aeg egg cae.gge gtc ggg 240 
^tiSernGly^^LeutGys Thr'^Ala^'Gln Ala Leu Ala Thr Arg His Gly Val Gly 
^'65 "70 .75 80 

^vgaqrgtg ^ctt - gte-acg gag gee cgc, gcc cgc ecc ggc gge aac att ace 288 
^Asp'..Val-- Leu Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn He Thr 

85 90 95 

acc gtc gag cgc ccc gag gaa ggg tac etc tgg gag gag ggt ccc aac 336 
Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn 

100 105 110 

age ttc eag cee tec gac ccc gtt etc ace atg gcc gtg gae age gga 384 
Ser Phe Gin Pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly 
115 120 125 

ctg aag gat gac ttg gtt ttt ggg gae cea aac gcg ecg cgt ttc gtg 432 
Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val 
130 135 140 

ctg tgg gag ggg aag ctg agg ccc gtg cea tec aag ecc gcc gae etc 480 
Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu 
145 150 155 160 
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ccg ttc ttc gat etc atg age ate cca ggg aag etc agg gee ggt eta 
Pro Phe Phe Asp Leu Met Ser He Pro Gly Lys Leu Arg Ala Gly Leu 

165 170 175 



528 



ggc geg ett ggc ate cgc ccg ect cct cca gge ege gaa gag tea gtg 576 
Gly Ala Leu Gly He Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val 

180 185 190 



gag gag ttc gtg cgc cgc aac etc ggt get gag gtc ttt gag cgc etc 
Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu 
195 200 205 

att gag cct ttc tgc tea ggt gtc tat get ggt gat cct tct aag etc 
He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu 
210 215 220 



ggt agt att. att ggt gga acc ate aag aca att cag gag agg age aag 
Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys 

245 250 255 



624 



672 



age atg aag get gea ttt ggg aag gtt tgg egg ttg gaa gaa act gga 720 
Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly 
225 230 235 240 



768 



aat cca aaa cca ccg agg gat gee cgc ett ccg aag cca aaa ggg cag 816 
Asn Pro Lys Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin 

260 265 , 270 

aca gtt gea tct ttc agg aag ggt ett gee atg ett cca aat gee att 864 
Thr Val Ala Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He 
275 280 285 

aca tec age ttg ggt agt aaa gtc aaa eta tea tgg aaa etc acg age 912 
Thr Ser Ser Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu. Thr Ser 
290 295 300 

att aca aaa tea gat gae aag gga tat gtt ttg gag tat gaa acg cca 960 
He Thr Lys Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro 
305 310 315 320 

gaa ggg gtt gtt teg gtg cag get aaa agt gtt ate atg act att cca 1008 
Glu Gly Val Val Ser Val Gin Ala Lys Ser Val He Met Thr He Pro 

325 330 335 

tea tat gtt get age aac att ttg egt cca ett tea age gat get gea 1056 
Ser Tyr Val Ala Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala 

340 345 350 

gat get eta tea aga ttc tat tat cca ccg gtt get get gta act gtt 1104 
Asp Ala Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val 
355 360 365 

teg tat cca aag gaa gea att aga aaa gaa tgc tta att gat ggg gaa 1152 
Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly Glu 
370 375 380 



etc cag ggc ttt ggc cag ttg cat cca egt agt eaa gga gtt gag aca 



1200 
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Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr 
385 390 395 400 

tta gga aca ata tac agt tec tea etc ttt cca aat cgt get cet gae 1248 
Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp 

405 410 415 

ggt agg gtg tta ett eta aac tac ata gga ggt get aca aac aea gga 1296 
Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly 

420 425 430 

att gtt tec aag act gaa agt gag ctg gtc gaa gea gtt gae cgt gae 1344 
He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp 
435 440 445 

etc cga aaa atg ett ata aat tet aca gea gtg gae ect tta gtc ett 1392 
Leu Arg Lys Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu 
450 455 460 

ggt gtt cga gtt tgg cca caa gee ata cet cag ttc ctg gta gga cat 1440 
Gly ^Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His 
^465 ^ 470 475 480 

ett gat ett etg gaa gee gea aaa get gee ctg gae cga ggt ggc tac 1488 
Leu Asp Leu Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr 

485 490 495 

gat- ggg ctg ttc eta gga ggg aac tat gtt gea gga gtt gee etg ggc 1536 
-Asp^Gly Leu Phe ^ Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly 

500 .505 510 

-.aga ^tge gtt gag:t,ggc-gcg tat, gaa agt gee teg caa ata tet gae tte 1584 
.Arg.Cys Val Glu. Gly Ala Tyr .Glu. Ser Ala Ser Gin He Ser Asp Phe 
515 520 525 

ttg^aec aag tat gee tac aag. tga 1608 
.:Leu vThr liys_Tyr.. Ala^ Tyr Lys 
■530 " 535 



<210> 47 
<211> 535 
<212> PRT 
<213> Zea mays 

<400> 47 

Met Val Ala Ala Thr Ala 

1 5 
Leu Asn Gly Thr Arg He 

20 

Val Arg Cys Ala Ala Val 
35 

Thr Gly Ala Arg Leu Ser 
50 

Ser Gly Leu Cys Thr Ala 
65 70 
Asp Val Leu Val Thr Glu 



Thr Ala Met Ala Thr 

10 

Pro Ala Arg Leu Arg 
25 

Ala Gly Gly Ala Ala 
40 

Ala Asp Cys Val Val 
55 

Gin Ala Leu Ala Thr 

75 

Ala Arg Ala Arg Pro 



Ala 


Ala 


Ser 


Pro 


Leu 








15 




His 


Arg 


Gly 


Leu 


Ser 






30 






Glu 


Ala 


Pro 


Ala 


Ser 




45 








Val 


Gly 


Gly 


Gly 


He 


60 










Arg 


His 


Gly 


Val 


Gly 










80 


Gly 


Gly 


Asn 


He 


Thr 
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Thr Val Glu Arg 

100 

Ser Phe Gin Pro 
115 

Leu Lys Asp Asp 
130 

Leu Trp Glu Gly 
145 

Pro Phe Phe Asp 



Gly 

Glu 

He 

Ser 
225 
Gly 



Ala Leu Gly 
180 

Glu Phe Val 

195 
Glu Pro Phe 
210 

Met Lys Ala 
Ser He He 



Asn Pro Lys Pro 

260 

Thr Val Ala Ser 
275 

Thr Ser Ser Leu 
290 

He Thr Lys Ser 
305 

Glu Gly Val Val 



Ser 

Asp 

Ser 

Leu 
385 
Leu 

Gly 

He 

Leu 

Gly 
465 
Leu 



Tyr Val Ala 
340 

Ala Leu Ser 

355 
Tyr Pro Lys 
370 

Gin Gly Phe 
Gly Thr He 

Arg Val Leu 
420 

Val Ser Lys 

435 
Arg Lys Met 
450 

Val Arg Val 
Asp Leu Leu 



Asp Gly Leu Phe 

500 

Arg Cys Val Glu 
515 

Leu Thr Lys Tyr 
530 



85 

Pro Glu Glu Gly Tyr 

105 

Ser Asp Pro Val Leu 

120 

Leu Val Phe Gly Asp 
135 

Lys Leu Arg Pro Val 
150 

Leu Met Ser He Pro 
165 

He Arg Pro Pro Pro 

185 

Arg Arg Asn Leu Gly 

200 

Cys Ser Gly Val Tyr 
215 

Ala Phe Gly Lys Val 
230 

Gly Gly Thr He Lys 
245 

Pro Arg Asp Ala Arg 

265 

Phe Arg Lys Gly Leu 

280 

Gly Ser Lys Val Lys 
295 

Asp Asp Lys Gly Tyr 
310 

Ser Val Gin Ala Lys 
325 

Ser Asn He Leu Arg 

345 

Arg Phe Tyr Tyr Pro 

360 

Glu Ala He Arg Lys 
375 

Gly Gin Leu His Pro 
390 

Tyr Ser Ser Ser Leu 
405 

Leu Leu Asn Tyr He 

425 

Thr Glu Ser Glu Leu 

440 

Leu He Asn Ser Thr 
455 

Trp Pro Gin Ala He 
470 

Glu Ala Ala Lys Ala 
485 

Leu Gly Gly Asn Tyr 

505 

Gly Ala Tyr Glu Ser 

520 

Ala Tyr Lys 
535 



90 

Leu Trp Glu Glu 

Thr Met Ala Val 

125 

Pro Asn Ala Pro 
140 

Pro Ser Lys Pro 
155 

Gly Lys Leu Arg 
170 

Pro Gly Arg Glu 

Ala Glu Val Phe 

205 

Ala Gly Asp Pro 
220 

Trp Arg Leu Glu 
235 

Thr He Gin Glu 
250 

Leu Pro Lys Pro 

Ala Met Leu Pro 

285 

Leu Ser Trp Lys 
300 

Val Leu Glu Tyr 
315 

Ser Val He Met 
330 

Pro Leu Ser Ser 



95 

Gly Pro Asn 
110 

Asp Ser Gly 

Arg Phe Val 

Ala Asp Leu 
160 

Ala Gly Leu 

175 
Glu Ser Val 
190 

Glu Arg Leu 

Ser Lys Leu 

Glu Thr Gly 
240 

Arg Ser Lys 

255 
Lys Gly Gin 
270 

Asn Ala He 

Leu Thr Ser 

Glu Thr Pro 
320 

Thr He Pro 
335 

Asp Ala Ala 
350 

Val Thr Val 



Pro Val Ala Ala 

365 

Glu Cys Leu He Asp Gly Glu 
380 

Arg Ser Gin Gly 
395 

Phe Pro Asn Arg 
410 

Gly Gly Ala Thr 



Val Glu Ala Val 

445 

Ala Val Asp Pro 
460 

Pro Gin Phe Leu 
475 

Ala Leu Asp Arg 
490 

Val Ala Gly Val 

Ala Ser Gin He 

525 



Val Glu Thr 
400 

Ala Pro Asp 

415 
Asn Thr Gly 
430 

Asp Arg Asp 

Leu Val Leu 

Val Gly His 
480 

Gly Gly Tyr 

495 
Ala Leu Gly 
510 

Ser Asp Phe 



63 



